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A  convenient  method  for  proving  weak  convergence  of  a  sequence 
of  non-Markovian  processes  to  a  jump-dif f usion  process  is 

proved.  Basically,  it  is  shown  that  the  limit  solves  the  martingale 
problem  of  Strook  and  varadhan.  The  proofs  are  relatively  simple, 
and  the  conditions  apparently  weaker  than  required  by  other  current 
methods  (in  particular,  for  limit  theorems  for  a  sequence  of  ordinary 
differential  equations  with  random  right  hand  sides) .  In  order  to 
illustrate  the  relative  ease  of  applicability  in  many  cases,  a  simple*  - 
proof  of  a  known  result  on  averaging  is  given. 
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1.  Introduction 


In  [1],  (2),  [3J,  Kurtz  developed  a  general  theory  for  the 

weak  convergence  of  a  sequence  ix^t-)  }  of  Inot  necessarily  Markov) 
processes  to  a  Markov  process  x(-).  The  usefulness  of  his  approach 
is  attested  to  by  the  applications  in  [4J,  [5J.  Kurtz's  method 

involved  a  general  approach  to  the  problem  of  semigroup  approxima¬ 
tion  and  using  it  in  (5)  we  were  able  to  improve  upon  the  approxima¬ 
tions  results  of  (e.g.)  [61,  [7)  with  both  simpler  proofs  and  better 

conditions.  Reference  [5)  dealt  with  the  weak  convergence  of  a 
sequence  of  solutions  to  ordinary  differential  equations  with  random 
right  hand  sides  to  jump-diffusion  process. 

In  this  paper,  some  of  the  remarks  and  ideas  on  proving 
tightness  in  [3)  are  exploited  to  get  a  result  of  the  same  type  in  a 
simpler  way.  The  general  theory  of  [3]  is  not  used.  Instead  of 
showing  weak  convergence  to  a  Markov  process,  we  show  weak  convergence 
to  a  solution  of  the  martingale  problem  of  Strook  and  Varadhan  [8J-[9]. 
Our  approach  has  several  advantages:  the  proofs  are  short  and 
straightforward,  less  care  is  required  in  the  choice  of  the  test 
functions  (Kurtz’s  dense  class  rJ)  ,  the  conditions  are  often  easier 
to  verify  -  especially  for  the  non-homogeneous  case  (e.g.,  we  do 
not  need  to  deal  with  the  density  of  (A-A)I/,  see  [3]).  Finally, 
the  extension  to  convergence  of  continuous  parameter  interpolations  of 
discrete  parameter  processes  is  easier  and  more  natural  in  that  we 
do  not  need  to  treat  the  interpolation  as  a  continuous  parameter 
process  -  but  only  look  at  it  at  the  "jump  points".  Also,  for  the 


-2 


discrete  parameter  case,  our  method  often  allows  the  perturbations4 
(f1")  of  the  test  function  f  to  be  obtained  by  relatively 
straightforward  means.  Some  additional  remarks  on  this  case  appear 
in  Section  4,  but  a  fuller  exposition  of  the  discrete  parameter 
case  will  be  given  in  a  subsequent  paper,  together  with  several 
applications. 

The  problem  is  set  up  in  Section  2  and  the  main  theorem  proved 
in  Section  3.  Some  remarks  on  tightness  are  made  in  Section  3,  the 
discrete  parameter  case  is  treated  in  Section  4  and  Section  5 
illustrates  the  simplicity  of  the  idea  by  giving  a  simpler  proof  of  a 
classical  problem  of  averaging.  One  possible  disadvantage  of  our 
approach  is  that  tightness  must  be  proved  first.  This  does  not 
seem  to  be  a  handicap  however,  since  in  many  applications, 
one  can  do  this  quite  easily  by  the  approximation  method  developed 
in  the  sequel  and  the  tightness  results  in  Kurtz's  paper  (3).  This 
is  illustrated  in  Theorem  5  or  by  the  way  tightness  is  proved  in 
(5),  (10],  where  Kurtz's  scheme  [3]  is  used. 

Since  writing  this  paper,  it  has  come  to  the  author's  attention 
that  Papanicolaou,  Strook  and  Varadhan  (13)  have  used  a  related 
method  to  get  a  variety  of  limit  theorems.  That  work  is  restricted 
to  a  purely  continuous  parameter  and  Markovian  framework  and,  except 
for  some  special  results  which  do  not  overlap  ours,  the  Markov 
driving  noise  (analogous  to  y ( • )  in  (3.6a))  is  required  to  have  a 
strong  ergodic  property.  Within  their  ergodic-Markov  framework. 


*We  use  t  -►  0  rather  than  n  ♦  ®  (as  in  (3])  to  index  the 
sequence. 


* 
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they  treat  a  wider  class  of  problems  than  we  do  here.  Our  xL{*)  are 
not  necessarily  Markovian,  and  various  ideas  of  Kurtz  [3]  are 
adopted  to  prove  the  requisite  tightness. 


2.  Problem  Formulation 

Let  { x *■  ( * )  }  denote  a  sequence  of  Rr  valued  processes  with 
paths  in  Dr[0,®)  (endowed  with  the  Skorokhod  topology),  and  all 

(w.l.o.g.)  defined  on  the  same  sample  space  (0,P,  **)  .  Let  { 
denote  an  increasing  sequence  of  sub  o-algebras  of  **  such  that 
(xL(s),  s  <  t)  is  ^  measurable.  Let  J/"  denote  the 
progressively  measurable  functions  with  respect  to  {  ) .  There 

are  progressively  measurable  versions  of  all  the  functions  introduced 
below,  and  we  assume  that  those  are  the  versions  used.  Define  y 
to  be  the  subset  of  y  for  which  sup  E|f (t) |  <  «.  Let 
SN*  {x:  I x |  <  N ) .  Let  C  ^  denote  an  increasing  sequence 

of  o-algebras  which  measures  x' (s) ,  s  <  t,  and  let  E^  denote  ex¬ 
pectation  conditioned  on  9^.  Define  the  operators  p-lim  and 


A1 

as 

in 

(3)  by: 

For  f  and  f 1  t  y, 

we  say  that 

p-lim  f "  c  f 

*  f 

6-0 

iff 

(2 

.1) 

holds. 

(2.2)  defines  A  and 

^(Ae)  . 

sup  E  |  f  ( t )  |  <  «°,  lim  E |  f  **  (t)  -  f  (t)  |  *  0  each  t; 
t ,  6  6-0 

f  c  £?(AC)  and  ALf  -  g  iff  for  f,  g  e  jT"  and  adapted 

to  {-**£)  and  g  p-right  continuous* , 

p-limlj  (E^f ( t ♦ 6 )  -  f(t))  -  g(t)]  -  0. 

6-0 

*For  q  t  y,  p-right  continuity  means  that  lim  E|g(t+6)  -  g(t) 1  * 
for  each  t  >  0. 


(2.1) 

(2.2) 


I 


0 
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Let  6  denote  the  space  of  continuous  bounded  real -valued 

r  ♦  a 

functions  on  R  *  R  which  are  zero  at  infinity,  *()  the 

^  Jl  fi  A  J 

subset  with  compact  support,  and  fc'0*  (resp.,  &  *  )  the  sub¬ 
set  of  (resp.,  £)  with  bounded  and  continuous  u 

t-partial  derivatives  and  0  x-partial  derivatives.  denotes 

the  Bor el  field  over  R^. 

Let  A  denote  the  diffusion  operator  l  b.(x,t)  * 

i  l  ax 

2 

2  “  a^(x,t)  dx  gx~  where  we  assume  that  the  coefficients  are 

continuous,  let  !(*,•)  denote  a  non-negative  real-valued  bounded 
continuous  function  on  R^  *  R* .  Let  u  (-,•#•)  be  a  function  on 
Rr  *  R*  *  05(1*^)  such  that  y(x,t,*)  is  a  measure  of  total  mass  unit. 

A 

(R^)  for  each  x,t,  and  such  that  for  each  f  (  >, 
f (x*y , t) u (x , t ,dy)  is  continuous  in  x,t. 

a  i  - 

Let  A  denote  the  operator  on  given  by 

Af(x,t)  *  Af (x, t)  ♦  X (x , t)  |  ( f (x+y , t )  -  f (x,t) )m (x.t,dy) . 

For  each  x  let  there  be  a  measure  Px  on  Dr[0,<*>)  such  that 
Px { x ( 0)  ■  x)  ■  1,  and 


(2.3)  for  each  T  <  ® ,  p  (sup  |x(t)|  <-)«!, 


and  which  is  assumed  to  be  the  unique  solution  to  the  martingale 
problem  of  Strook  and  Varadhan  (8J - [ 9] :  vis,  for  each  f(-)  t  £\'2 , 
the  function 


(2.4) 


Mf (t)  -  f (x(t) ,t)  -  f (x ( 0) 


,0)  -  ft 

J  n 


(jg  +  A) f (x (s) , s) ds 


is  a  P^  martingale.  By  the  uniqueness,  x  can  be  replaced  by  any 
random  variable  X  in  the  above. 

3 .  The  Convergence  Theorem 

Theorem  1  requires  tightness;  since  we  do  not  work  within  a 
semigroup  framework,  we  cannot  first  prove  convergence  of  finite 
dimensional  distributions  as  done  in  [3],  and  then  prove  tightness. 

For  many  problems  (as  will  be  seen)  this  is  no  barrier  to  effective 
use  of  our  approach.  It  is  often  much  easier  to  prove  tightness 
when  the  {xt(t)J  are  bounded  uniformly  in  e,t.  In  order  to 
exploit  this  fact,  we  introduce  a  sequence  of  truncations  of 
(x  (•))»  prove  convergence  for  each  truncation,  and  then  take 
limits  to  get  convergence  for  the  original  sequence.  Section  5 
illustrates  how  such  an  idea  can  be  used  for  at  least  one  class 
of  problems.  See  also  the  remark  after  the  proof  of  Theorem  1.  The 
truncation  method  seems  quite  natural  and  easy  to  use  for  the  classes 
of  problems  in  15)-(7].  In  fact,  the  general  method  for  creating  the 
truncations  x  ,N(-)  is  simply  to  stop  or  "slow  down"  xt(»)  when 
|x  (t)|>  N.  The  limits  of  these  truncations  (as  e  0)  determine  the 
limit  as  t  ♦  0  of  x^*),  under  our  uniqueness  hypothesis. 

Because  of  the  truncation  idea,  we  introduce  the  following  definition. 
Let  X ( * )  denote  the  process  defined  by  the  solution  to  the  martingale 
problem  (2.4)  with  initial  condition  X(0)  (i.e.,  with  measure 

PX(0)>.  Let  AN  denote  an  operator  of  the  form  of  A  with  co¬ 
efficients  aN (*,♦), bN (•»•), (•,•), (•,•»• )  satisfying  the 


conditions  in  the  paragraph  above  (2.3)  and  which  are  equal  to  the 
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coefficients  of  A  when  |x|  <  N.  Suppose  that  xN ( • )  is  a  process 
with  paths  in  Dr[0,»)  such  that  XN(0)  -  x ( 0 )  weakly  and  XN ( • ) 
solves  (not  necessarily  uniquely)  the  martingale  problem  (2.4)  -  with 
initial  condition  XN(0).  Then  XN(*)  is  called  an  N-truncation  of 
X(-)  . 


Theorem  1 .  Assume  the  conditions  of  Section  2.  For  each 
6  >  0,  let  (xl'N(  •) }  be  a  bounded* sequence  of  processes  with 
paths  in  Dr(0,<*>)  satisfying  x  'N(t)  =  xL(t)  for 

t  <  inf  {  s :  |  xL  (s)  |  >  N}.  Let  xc  (0)  X(0)  weakly.  If  xL(0)  >  N, 

C  ,  N  _  A 

set  x  (t)  =  0.  Let  rJ  be  a  dense  (in  sup  norm)  set  in  of 

functions  with  compact  support,  and  suppose  that  xN(*  )  is  an 
N-truncation  of  X(*)  (initial  condition  X(0)).  Let  q  ^ 

be  an  increasing  sequence  of  o- algebras  maasuruyj  xL,N(s),  s  <  t, 
and  define  the  operator  AL,N  (using  V’t'N  and  xC'N(-))  just 

i!  was  defined  (using  and  xL(*)).  Let  E^,N  denote 

pxDGCtation  conditioned  on  ^t,N 

— — - — - •  Suppose  that  for  each  N  and  each 

f{*)  e  there  is  a  sequence  f*'N(*)  c  ^/(AL'N)  such  that 

(3.1)  p-lim(fL*N(.)  -  f(xt,N(-) ,  •))  -  0 

e-*0 


Actually,  all  that  is  needed  is  lim  sup  P{ sup | x L ,N (t) |  >  K)  *  0 

K  ♦«  e  t<  T 

for  each  T  >  0,  but  boundedness  does  not  seem  to  be  a  restriction 
in  our  truncation  method. 
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(3.2) 


P-1  im  (At’Nf£*N(-)  -  (|-  ♦  AN)f(x£*NC),-)l  ‘  0, 
e-o  1 


Then,  if  (xL,J1(*))  is  tight  in  Pr(0,«)  for  each  N,  { x e  ( - )  > 
converges  weakly  to  X( • ) • 

Proof .  Fix  N  and  f(*)  t  _/ .  Choose  satisfying 

(3.1)-(3.2).  Equation  (3.3)  follows  from  the  definition  of  AL’^ 
( see  Kur t ;  [  3  ] ) 


(3.3) 


E^*V’N(fs)  -  f£,N(t)  -  Jl  SE£»NA£'NfC’N 


(u)du 


Let  <3  denote  an  arbitrary  integer,  let  t.  <  t,  i  <  q,  and  let 


l  - 


g(-)  be  a  bounded  continuous  function  on  Rrq.  Then  by  (3.1)-(3.3), 


lim  Eg(x£'N(t  ),  i  <  q)  (f(xC,N(fs),fs)  -  f(xe’N(t).t) 

f  *  A  1 


c  *0 


ft  +  S  ■>  LT  r  K 

'  j  (at  4  A  ’  (u),u)du)  -  0. 


Choose  and  fix  a  weakly  convergent  subsequence  of  {xL,!S,(')}t 
indexed  also  by  e  and  with  limit  denoted  by  X  (•)  and  where 
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N  N  £  + 

X  (0)  *  weak  limit  of  {x  '  (0)}.  Then,  by  the  weak  convergence 

N 

and  the  continuity  properties  of  the  coefficients  of  A  , 

(3.4)  Eg(XN(ti),  i  <  q)|Mj(t+s)  -  M^(t)]  =  0 
where 

(3.5)  M‘J(t)  -  f(XN(t),t)  -  f(XN(0),0)  *  j  (|y  ♦  AN)f  (XN(u)  ,u)du. 

By  the  density  of  and  the  arbitrariness+of  q,  (t^)  and  g(*) 

and  the  fact  that  for  each  T  <  ®,  P{sup|xN(t)|  <  -}«1,  m*  ( • )  is 

t<T  f 

a  martingale  for  each  N,  f(-,-)  c  $X'2.  Thus,  there  is  a  measure  P„ 

0  N 

on  Dr[0,®)  such  that  under  PN,  x(0)  has  the  distribution  of  XN(0) 
and  P  solves  (not  necessarily  uniquely)  the  martingale  problem 

(2.4)  with  AN  replacing  A.  Let  px^0)  den°t«  the  unique  measure 
solving  the  martingale  problem  (2.4)  (initial  condition  X(0)).  By 
the  uniqueness,  Pv/f..  and  P  must  agree  on  measurable  subsets 

of  the  set  {  x  ( * )  :  |  x  ( t )  |  <  N,  t  <  T)  e  Dr [0,®)  for  each  T  and 
M  >  N.  From  this  and  (2.3),  it  follows  that  xL(*)  ■*  X(-) 
weakly.  Q.E.D. 


*For  (3.4)  to  hold,  it  may  be  necessary  to  require  that  t^  /  TQ,  a 

null  set  which  might  depend  on  X*1  ( • )  ,  but  this  deletion  does  not 
affect  the  proof. 
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C  N 

Remarks.  One  useful  method  for  getting  the  f  '  (•)  from 
f(*,*)  is  developed  in  Section  5.  A  similar  method  was  used  in 
( 5 I ,  [10],  where  Kurtz's  original  method  [3]  was  used  to  prove 

weak  convergence,  and  no  N-truncations  were  required.  In  (5), 

£ 

{x  (•)}  was  defined  as  follows.  For  a  stationary  ^-mixing  process 
c  2 

y ( * ) ,  define  y  (t)  *  y(t/t  )  and  for  appropriate  F,G,  let 


(3.6a) 


xL  -  G(xL,t,yL)  +  F (xL ,t,yL)/fc. 


The  form  (3.6a)  uses  a  frequently  used  scaling  (see  ]6],  [7])  to 

get  limit  theorems  for  the  solutions  to  a  sequence  of  ordinary 
differential  equations  with  random  right  hand  sides.  Let 
qN ( • )  denote  a  real  valued  infinitely  differentiable  function  on 
Rr  which  equals  one  on  S^,  is  zero  on  Rr  -  SN+1,  and  is  bounded 
by  unity.  Then,  we  might  define  xt,N(*)  by 


(3.6b)  *C,N  -qM(xC*N) (G(x£*N,t,yC)  ♦  F(xe*N,t,ye)/e] 


The  sequence  [xL,N(*),  t  >  0)  is  obviously  tight  for  each  N, 
since  xt,N(*)  is  bounded.  Such  a  truncation  device  is  often 
useful  when  the  xL(*)  is  given  via  an  explicit  dynamical  equation. 
The  method  of  this  paper  avoids  the  need  for  the  density  of 

A 

(X-A)  3  in  ’,  for  some  A  >  0,  as  required  by  the  method  of  [3]. 


Essentially,  the  density  condition  is  replaced  by  the  uniqueness 
condition  for  the  solution  to  the  martingale  problem. 
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Tightness .  We  have  assumed,  in  Theorem  1,  that  for  each  N, 
xL'NC)  is  bounded  uniformly  in  e.  Thus  tightness  of  {xu'N(*)J 
is  equivalent  to  tightness  of  (f(xL'N(-))}  for  each  f(')  t  Cq, 
the  space  of  continuous  bounded  real  valued  functions  on  Rr  with 

compact  support.  We  prefer  to  deal  with  f(xL'N(’))  rather  than 
with  x  '  (•)  directly  because,  as  seen  below,  it  often  enables 
us  to  use  the  f  '  (•)  constructed  in  Theorem  1.  The  following 
special  case  of  Kurtz's  theorem  [3,  (4.20))  will  be  useful. 

Lemma  1 .  Let  { y  L  ( • ) )  be  a  bounded  sequence  of  processes  with 
paths  in  Dr[0,“).  Suppose  that  for  each  T  <  6  >  0,  there  is  a 

random  variable  Y ^  (  6 )  such  that* 

(3.7)  E^Yc(6)  -  Etmin  I1*  I  (t+u>  ~  yC(t)  |2)  ,  all  u  <  6,  t  <  T, 
and  that 

(3.8)  lim  Tim  Ey  (6)  ■  0. 

6-0  e-0 

Then  {yL(*))  is  tight  in  D^IO,®). 

For  our  problem,  y^')  *  f(xL,N(*))»  for  f(*)  c  CQ.  The 
following  useful  method  of  getting  the  YL(6)  is  suggested  by  a 
comment  in  [3),  and  is  an  extension  of  a  result  in  15). 

+e£  denotes  conditioning  on  yL(s),  s  <  t. 


Theorem  2.  Fix  N.  For  each  f(*)  in  a  dense  get  ^  c  cQ 
which  contains  the  square  of  each  function  in  it,  ,  let  there  be  a 
sequence  {  f L '  ^  ( • )  }  if}  ^  such  that  f  *  ^  ( * )  t  ^(A  )  »  >nd 

for  each  real  T  >  0  let  there  be  a  random  variable  M.'  (f)  such 
that 

(3.9)  P(  sup  1  f L  *N  ( t )  -  f(xL,N(t))  |  >  a}  ♦  0  as  e  ■*  0  for  each 

t<T 

a  >  0# 

(3.10)  sup|AL,N  fC,N(t)|  <  M^'N 

t<T  ~  T 

(3.11)  sup  P{  Ail  ’ N  ( f )  >  K)  -  0  as  K  -  «. 
c>0 

Then  {f(xL,N(-)))  is  tight  in  Dr  ( 0 ,  —  )  for  each  f(«)  t  Qf^ 

and  (  x  L ' “,  ( • )  )  is  also  tight  in  Dr  [(),•). 

Remark .  Normally,  we  let  be  the  subset  of  functions  in 

*3  which  do  not  depend  on  t,  and  the  f  '  (•)  would  be  constructed 
as  they  would  be  for  use  in  Theorem  1.  This  is  illustrated  in 
Section  5,  and  also  in  (5),  [101,  where  the  sequence  {ft(*)l  of 

perturbations  must  be  constructed  in  order  to  use  Kurtz's  original 
method.  Equations  (3.9) -(3.11)  require  more  of  the  f  '  (•)  than  the 
p-lim  requirements  of  Theorem  1.  But,  as  implied  above,  these 
additional  properties  often  exist.  In  the  example  of  Section  5 
Al’N  ft,N(t)  is  bounded  uniformly  in  t  and  the  sup  term  in  (3.9) 
goes  to  zero  uniformly  in  w  (w.p.l)  as  e  -*  0.  Of  course, 
tightness  of  (x  '  (•)>  might  be  a-priori  obvious,  as  in  the  case 
noted  in  the  remark  after  Theorem  1. 


proof .  In  order  to  simplify  the  notation,  drop  the  superscript 
N.  By  Lemma  1  and  (3.9),  for  each  f(*)  «•  and  T  <  -,  we  need 

only  find  Yl(6)  satisfying  (3.8)  and  such  that  for  u  ^  6,  t  <_  T, 

E^YC  ( 6)  >  min[l,E*|f  U^t+u))  -  f  (x  *•  (t) )  |  2 1  . 

Let  f(«)  t  9^  and  let  (fC(*))  (resp.,  {f^C*) ))  denote  the 

sequences  in  i?(XL)  associated  with  f(*)  (resp.,  with  f2(*))  by 
the  hypotheses.  Write  f(xL(t))  •  f  (t) .  Then 

|  f  (t+u )  -  f  ( t )  |  2  -  (fjU+u)  -  f^(t))  -  2f  (t)  (fC  (t+u)  -  fC  (t)  ) 

♦  (f2(t+u)  -  f2(t+u))  ♦  (fL2(t)  -  f 2  (t )  ) 

♦  2f  (t)  ( (fL  (t+u)  -  f (t+u) )  -  (fC (t)  -  f (t)  )  1  . 

t  t  2 

First  suppose  (Case  1):  sup  EM_(f)  ♦  sup  EMl(f  )  <  •  and 

e  t 

sup  (|fC(t)  -  f  (t )  |  ♦  |f2(t)  -  f 2  (t )  |  )  =  ol(T)  -  0  uniformly  in  <*» 
t<T 

as  e  -»  0.  Then  we  can  get  a  Y£(«)  of  the  desired  type  if  there  is 
function  7  ^(6)  satisfying  (3.8)  and  such  that  for  u  <  6,  t  <  T, 

EjT  t(6)  >  2 1  |  f  ( • )  |  |  |  E^f  C  (t+u)  -  f c  Ct)  |  +  I E  *f  2  (t+u)  -  f2(t)|. 

By  (3.3)  and  (3.10),  we  can  use 


7t(6)  *  6(21  |f  (•)  |  |M^(f)  ♦  M^.(f2)). 
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and  the  proof  of  Case  1  is  completed. 

Now,  let  the  condition  of  Case  1  not  hold  and  use  a  truncation 


argument.  For  each  6  >  0  define 


r<  ■  minlt:  |ALfL(s)  >  1/6  or  | A*" f ^ I  >  1/* 


jL (t)  >  6). 


Let  denote  a  stopping  time  (relative  to  {&^})  and  note  (3) 

that  (3.3)  holds  if  t  and  t  ♦  s,  resp. ,  are  replaced  by  t  n  ^ 
and  (t«-s)  fl  it,  resp.  Now  repeat  the  Case  1  argument  to  get  that 
{ f ( -  n  ) )  is  tight  for  each  6  >  0.  Equations  (3.9)  and  (3.11) 

2  t 

(for  both  f  and  f  )  imply  that  lim  lim  inf  P(T  >  T)  ■  1.  This 

6-*0  t-*0  $ 

and  the  tightness  of  ( f ( *  h  T  < )  )  for  each  6  >  0  completes  the 
proof . 


4.  Discrete  Parameter  Processes 


Only  a  sketch  will  be  given.  For  each  c  >  0,  let  {x^}  denote 
a  discrete  parameter  Rr  valued  process  and  let  ( ^  ^ )  denote  a 
sequence  of  positive  numbers  tending  to  zero.  Define  the  interpola¬ 
tion  xL(*)  by  xl(t)  *  xi  on  lit£,  if  t4‘tc)  •  Let  xL'**(*)  be 
defined  the  same  way  it  was  defined  in  Theorem  1.  For  f(*)  c 
def  ine  A  t,Nf  ( • )  by 


(4.1) 


AL,Nf(t)  -  [E^'Nf (t+ft)  -  f(t))/ffc. 
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Then 


(4.2)  E^'Nf((i+n)  t  £)  -  l  E^NAu,Nf  ( j  x  t )  i 


The  suggestion  for  adapting  the  continuous  parameter  results 
to  the  discrete  parameter  case  which  was  made  in  (3,  p.  625)  can  be 
hard  to  use,  because  with  that  approach  the  construction  of  the 
(f  (•)}  or  {f  '  (•)}  can  be  quite  messy.  It  often  seems  best 
to  deal  with  the  discrete  parameter  case  directly,  using  (4.2)  and 
the  martingale  method  of  Theorem  1.  In  many  cases,  our  method  seems 
to  simplify  the  problem  of  obtaining  appropriate  (ft(*))  or 
fft,N(*)}*  For  example,  for  the  discrete  parameter  analog  of  the 
problems  in  (5),  (12)  or  in  Section  5  below,  fLf)  (or  fL,N(*)) 
takes  the  form 

f C(- )  -  f(-)  ♦  fj(-)  +  f 2  ( } ' 

where  all  functions  are  constant  on  intervals  ( i 1 1 , i t i L )  and 

where  the  f^(*)  are  constructed  analogously  to  the  method  used  in  (5), 
(12),  or  as  in  Section  5  below,  but  where  the  appropr iate  series  are 
used  in  lieu  of  integrals.  A  simpler  treatment  of  the  discrete 
parameter  case  was  one  of  the  motivations  for  the  approach  of  this 
paper.  Examples  will  appear  in  a  subsequent  paper.  Using  the 
methods  of  [3),  (5),  reference  (10)  shows  how  to  get  the  fL(*)  from 
f(*)  for  one  type  of  discrete  parameter  case. 
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Theorem  J .  Assume  the  conditions  of  Theorem  1,  except  that  the 
f  1  ' N  ( • )  are  constant  on  the  intervals  ,  and  the 

"discrete  parameter"  A  '  of_  ( 4 . 1 )  is  used.  Then  the  conclusions 
of  Theorem  1  hold. 


The  proof  is  almost  identical  to  that  of  Theorem  1  and  is 
omitted.  We  need  only  work  with  (3.3)  with  t,s  being  integral 
multiples  of  it.  Then  (3.3)  reduces  to  (4.2). 

Theorem  2  can  be  used  to  prove  tightness  and  the  "sups"  are 
over  points  i*t  £  T*  i  ■  0,1,...»  only. 

5.  An  Application  to  Averaging 


In  order  to  illustrate  the  ease  of  use  of  Theorem  1,  we  redo 
the  classical  averaging  problem  of  Khazminskii  (11).  The  conditions 
are  not  the  best  and  can  easily  be  weakened.  Let  £(•)  be  a 
valued  stationary  bounded  right  continuous  i*-mixing  process  with 

.CD 

rate  satisfying  ♦  ^^(s)ds  <  Let  F(*,*,*)  be  a  continuous 

R r  valued  function  on  Rr4l*q,  whose  first  partial  x-derivativef 

are  continuous.  Define  F(x,t)  *  EF  (x  ,  t  ,4  (v  ) )  .  Let  q^t")  as 

below  (3.6a).  Define  £C(t)  -  4(t/e),  FN(x,t,£)  ■  F  (x ,  t )qNU )  . 
FN(x,t)  -  F(x,t)qs(x)  and  define  xc(-)  and  xt,N(-)  by 


(5.1a) 


F(x£,t,(.C) 
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its  mean  and  the  ^-mixing  condition, 

(5.2)  f^'N(t)  =  0(fc)  uniformly  in  t,w. 

The  gradient  f^'^(x,t)  also  exists,  and  the  differentiation  can 

i  #  X 

be  done  under  the  integral  sign.  Also,  f  ' N  (xL ' N  (t)  ,  t )  -  0(t) 
uniformly  in  t,w. 

Define  the  operator  AN  by  ANf(x,t)  *  F * (x , t) f  (x, t) .  Define 
the  function  fL'N(*)  by  fL'N(t)  *  f(xL,N(t),t)  ♦  fJ'N(t).  Then 
fL'N(*)  t  ^At,N)  and  (abbreviate  x  *  xL'N(t)) 

(5.3)  AC*Nfe*N(t)  -  f  t  ( x , t )  +  f;(x,t)FNlx,t,4L(t)) 

-  iFN(x,t ,4L(t))  -  fn c x . t ) j  ♦  (fi'x(x,t)) v**t.te(t)) 

“  (y£  ♦  AN)f(x,t)  ♦  0(t). 

Equations  (5.2)  and  (5.3)  and  the  uniqueness  of  x(*)  imply  the 
conditions  of  Theorem  1.  Q.E.D. 

Linearization  and  first  order  perturbations.  We  now  examine  the 
asymptotic  properties  of  the  centered  solution  xL(t)  -  x(*)  *  6xe(t), 
as  e  -►  0. 

Theorem  5 .  Assume  that  F  ( • , t , 4 )  has  two  continuous  in  ( x , t , 4 ) 

xC (0)  -  x 

partial  x-derivatives,  and  that  - -  •*  0  weakly. 

Then  under  the  conditions  of  Theorem  4, 


ut(*)  *  (x^O  -  x(*))/^t  converges  weakly  to  the  solution  of  the 
stochastic  differential  equation  (5.4) 


(5.4a)  du  -  Fx(x(t) , t)udt  ♦  dB,  u(0)  -  0, 

where  B(-)  is  a  Brownian  notion  with  covariance 

d-E  B(t)B'Ct)  . 

Jt 

(5.4b)  j  E (F (x ( t ) , t ,& ( 0) )  *  F(x(t).t))(F(x(t).t.4(s))  -  F (x ( t ) , t ) )  'ds 


Proof .  We  have 

(5  5)  0t(t)  a  F(xC(t),t,(l(t))  -  F(x(t)  )  +  6FL(t) 

/c  /c 

where  6Ft(t)  *  F  (x  (t)  ,  t  ,£ L  ( t)  -  P(xvt),t).  Althajgh  x(0)  *  xQ  might 

be  random  it  plays  no  important  role  in  the  proof  and  we  suppose 

for  convenience  that  it  is  constant  and  N  will  be  large  enough 
such  that  xQ  <  N.  We  need  only  work  on  an  arbitrary  finite 

interval  (0,T)  and  hence  can  suppose  that  |x(t)|  is  bounded 

in  t.  Also,  by  a  suitable  choice  of  probability  space,  we  can 

suppose  that  xL(0)  •*  xQ  w.p.l.  Thus,  since  x(*)  is  determinisitic 

and  xt(*)  -*  x(*)  weakly,  for  each  6  >  0  there  is  an  >  0 

such  that  P(sup|xL(t)  -  x(t) |  >  6)  <  6  for  c  <  t ..  in  order  to 

t<T 
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prove  (5.4)  we  can  w.l.o.g.,  suppose  that  for  an  arbitrarily  small 

but  fixed  6  >  0,  sup|xL(t)  -  x(t)|  <  6. 

t<T 

Using  these  simpl ications  and  expanding  (5.5)  yields 


uL  -  Fx(x,t,4t)uL  +  /e  (uL)  'Fxx  (t)ub  +  6FL(t)//c, 


where  Fxx  (t)  is  a  matrix  which  is  bounded  on  10, T] .  Define 
uL,N(‘)  by  uL,N(0)  ■  uL(0)  when  |uL(0)|  <  N  and  zero  otherwise, 


and  for  t  >  0  by 


(5.6)  ue,N  -  IFxa,t,;C)uC*N  ♦  /^(ue'N)'^xx(t)ufc'N]qN(uC*N) 


♦  6FL(t)//e. 


If  Bc(t)  -  f  6F  (s)ds//e  is  tight  on  D  (0,-)  with 

J  0 

all  weak  limits  being  the  same  Wiener  processes,  then  we  note 

£  N 

merely  in  passing, that  (5.6)  implies  that  (u  ’  (•))  is  tight 
for  each  N  and  that  all  weak  limits  have  continuous  paths. 
Proceeding,  let  f(-,-)  c  *  Then 


Ae’Nf(ue’N(t),t)  -  ft(ut,N(t) ,t)  ♦  f^(u' 


The  component 


/e  f^(uC,N(t) ,t)qN(uC'N(t)) [(uC'N(t)) *Fxx  (t)ue'N(t) ) 


of  the  second  term  above  always  has  p  lim  *  0.  We  must  find  a 

£-0 
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sequence  {fL'N(*)J  satisfying  the  conditions  of  Theorem  1.  We 
look  for  fe'Nf)  of  the  form 

fC'N(t)  *  f  (uC'Nlt)  ,t)  +  l  fC'N(t) 

i-1  1 

where  the  f^'N(t)  have  the  form  f ^ 'N (ut,N (t) , t)  and  the 
f (U,t)  are  now  to  be  defined. 

OD 

(5.7)  f^,N(u,t)  -  E  ^  * N  f  *  ( vi ,  t  ♦  s )  l  ( F  (x  ( t  ♦  s )  ,t^s,4e(t>s)) 

0  x 

*  •  t*s))qN(u)  *  — — (|"S)']ds 

n  /c 

•  ao 

-  e  J  Et,Nfu(u't4CsHFx(*(t*ts)»t*cs,4(|  ♦$)) 

-  Fx(x(t*cs) ,t»ts))qN(u)  ♦  6FL(t^ts)//cJds. 

By  the  compact  support  of  f <•,•),  the  centering  of  the  integrand 
about  its  expectation  and  the  ^-mixing,  p-lim  ff,N(t)  =  0.  Also, 
fj  (*#t)  is  differentiable,  and  we  can  take  the  derivative  under 
the  integral  sign.  Using  this,  we  readily  show  that  f^'N(*)  fc 
5>(A£',N)  and  (abbreviate  u  *  ut,N(t)) 

At,Nfj,N(t)  -  -r (u,t)  l(Fx(x(t) ,t,tL(t))  -  Fx(x(t) ,t))qN(u) 

♦  6FC(t)//£)  ♦  fJ;JJ(u,t)  •ut,N(t). 


(5.8) 
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The  only  component  of  the  last  term  of  (5.8)  which  does  not  have 

p-lim  =0  is  that  arising  from  the  6FL//t  part  of  ut,N  and 
e^O 

is  (u  '  still  written  as  u) 


.  oo 

i  ' ^E^'N6Fl ’ (t+s) fuu(u,t+s) 6FC(t)ds 


.e  *  N  r_t  * 


Et'  6F  (t+ts)fuu(u,t+ts)6F  (t)ds  =  Q  '  f  (u,t) 


which  is  bounded  uniformly  in  ui,t  by  the  ^-mixing.  Define  the 
operator  AQ  by  EQt,Nf(u,t)  =  Agf(u,t).  By  the  stationarity  of 
£(*)  and  the  ^-mixing  we  have  the  following  convergence  uniformly 
in  u,t,  where  6F(t,s)  =  F (x (t) , t , C (t+s) )  -  F(x(t),t)  and  the 
integral  defines  the  operator  A.: 


A  f(u,t)  -  E6F’(t,s))  f  (u,t)6F  (t,0)ds  -  A„f(u,t) 


Next,  in  order  to  "average  out"  the  Q  ,N  term,  we  introduce 
f2#N(u,t)  in  the  form 


»<X>  a  OD 

f  2  (u  ,  t )  -  j  dsE^,N{|  j  dv  (t*s*v)fuu(u,t*s*v)6FC(t*s) 


-  A|jf(u,fs)} 


By  the  change  of  variables  s/e  s,  v/e  ■*  v,  and  the  ^-mixing 
condition,  it  is  readily  seen  that  fj’^Cu.t)  *  0(e)  uniformly  in 
u»,t,u.  Also,  f 2*^(t)  -  f2,N(uE,N(t) ,t)  c  ^(AC,N)  and 
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At,Nf*,N(t) 


-Qfc'Nf(ut'N(t),t)  *  Aof(uL’N(t),t)  ♦  0(/t), 


where  0(/t)  is  uniform  in  w,t. 
Adding  all  the  above  yields 


(5.9) 


p-lim(fC'N(*)  -  f(u'’*"(-).'))  «  0 
e-0 


c.N 


i  - 1  im  [Ae,Nf  t,N(  ’ )  *  {ft(uC<Y).-) 


e-0 


(5.10)  ♦  f^(uC*N(-).-)qN(u€,N(-))Fx(x(-).-)ut,1N(-) 


c.N,. 


♦  A0f(uC,N(-) ,*)}]  “  0. 


If  {ue,N(-)>  or  (BY))  were  tight,  then  via  Theorem  1  any 
weak  limit  of  {ut'N(’))  would  be  a  diffusion  with  operator  aN 
where 

(5.11)  ANQu,t)  *  f^(u,t)qN(u)Fx(x(t)  ,t)u  ♦  AQf(u,t) 


and  Theorem  1  would  give  us  the  limit  (5.4).  Then,  to  get  the  covariance 
form  given  in  (5.4b)  from  (5.11),  we  just  symmatrize  the  co¬ 
efficients  in  Aq:  Write  Agf(u,t)  ■  ^  I  #ij^^u  u  ^u,t^  '  where 

i  *  j  A  J 

we  take  the  symmetric  form  for  fa^t-)}  -  (i.e.,  ai;j(*)  *  aji(-)). 

Then  note  that 

(E6F(t,s) 6F‘ (t,0) )  1  -  E6F(t,-s) 6F’ (t,0) . 


and  use  AQf(u,t)  -  y  trace  E(t)fuulu,t)  and  ^-EB(t)B'(t)  -  E(t). 


The  tightness  of  {ul,N(*))  could  be  proved  either  by  proving  it 
directly  for  {u‘',N(-))»  or  for  the  { B  ^  ( • )  }  and  then  using  the 
remark  below  (5.6).  To  prove  the  weak  convergence  of  (  BL(*))  to  the 
desired  Wiener  process,  we  could  work  with  BL'N(*)  defined  by 
B'N  =  6f  ^  (t )  qN  (B*- ' N) //e ,  BL,N(0)  *  0,  and  proceed  exactly  as  for  the 
(ul'N(-) )  above  to  show  the  desired  convergence.  An  alternative 
method  for  both  tightness  and  weak  convergence  follow  the  lines  of  the 
proceeding  development  but  with  the  6fL  in  (5.6)  multiplied  by 
qN(uL,N)  also.  Then  (uL'N(-))  is  bounded  for  each  N  and 
tightness  can  be  proved  by  the  method  of  Theorem  2.  The  details  are  thi 
same  as  for  the  given  development,  except  that  a  term  proportional  to 
qM  „(u)  appears  in  the  expression  for  the  weak  limit  of 
(u-V),  c  >  0).  But  this  term  is  zero  for  |u|  <  N  and  disappears 
when  N  -*  00 .  For  this  method  the  f^  satisfy  the  same  bounds 
(0(/t))  as  they  do  in  the  given  development  and  the  M^,'N(f) 
bounded.  Thus,  Theorem  2  implies  tightness.  We  omit  the 


are 
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DIFFUSION  APPROXIMATIONS  TO  OUTPUT  PROCESSES  OF  NON-LINEAR  SYSTEMS 
WITH  WIDE  BAND  INPUTS,  AND  APPLICATIONS 

•  *  * 

Harold  J.  Kushner  ' 

Abstract 

Many  problems  in  communication  theory  involve  approximations 
of  a  Markov  type  to  outputs  of  non-linear  (feedback  or  not)  systems, 
often  so  that  Fckker-Planck  techniques  can  be  used.  A  general 
and  powerful  method  19  presented  for  getting  diffusion  approxima¬ 
tions  to  outputs  of  9y9tems  with  wide  band  inputs.  The  input  is 
parameterized  by  c  and  as  e  ■*  0  the  band  width  goes  to  •  (loosely 
speaking).  It  is  proved,  under  reasonable  conditions  on  the 
systems  and  noise,  that  ths  sequence  of  system  output  processes 
converges  weakly  to  a  Markov  diffusion  process,  which  is  charac¬ 
terized  completely.  Many  communication  systems  fit  the  model  of 
the  paper  and,  in  order  to  make  mathematical  sense  out  of  many 
common  developments  of  system  properties,  assumptions  such  as 
those  of  this  paper  are  often  required.  The  usefulness  and 
relative  ease  of  use  of  the  method  is  illustrated  by  application 
to  three  examples:  (a)  phase  locked  loop,  where  a  Markov  diffu- 
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■ion  approximation  of  the  error  process  is  developed,  (b)  adaptive 
antenna  system,  where  an  asymptotic  analysis  of  the  equations  for 
the  system  is  given,  (c)  diffusion  approximation  to  the  output  of 
a  hard  limiter  followed  by  a  band  pass  filter;  input-output  S/N 
ratios  are  developed  (a  version  of  a  classical  problem  of  Daven¬ 
port)  .  Difficulties  with  the  usual  heuristic  approaches  to  (a) , 

(b)  are  discussed.  The  method  is  versatile  and  the  models  quite 
ganeral .  Since  weak  convergence  methods  are  used,  the  approximate 
"limits"  yield  approximations  to  many  types  of  functionals  of  the 
actual  systems. 
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1.  Introduction 

Many  problems  in  communication  theory  involve  representations 
of  (or  approximations  to)  outputs  of  devices  (linear,  nonlinear, 
feedback)  whose  inputs  are  signals  added  to  a  relatively  wide 
band  noise;  e.g.,  phase  locked  loops  (PLL) ,  adaptive  antenna 
arrays  or  automatic  gain  controls.  Normally,  the  development 
of  the  output  representation  (or  approximation)  requires  special 
assumptions  (e.g.,  sinusoidal  inputs,  Gaussian  noise),  and 
various  heuristic  arguments  are  usually  needed  to  approximate 
the  output  by  Markov  diffusion  processes  whose  Fokker-Planck 
equation  is  to  be  analyzed  in  order  to  get  some  sort  of  approx¬ 
imation  to  the  statistics  of  the  true  output  process. 

In  this  paper,  a  rather  powerful  method  is  presented  for 
getting  either  the  usual  or  related  approximations,  under 
assumptions  which  are  reasonable,  explicit,  and  often  weaker 
than  the  usual  ones.  The  relative  ease  of  use  of  the  method 
is  illustrated  here  by  applications  to  three  rather  different 
problems:  (a)  the  PLL,  (b)  an  adaptive  antenna  array,  (c)  a 
version  of  Davenport's  {  j  )  result  on  the  output  of  a  band 
limiter  followed  by  a  zonal  filter. 

In  particular,  denote  the  input  noise  by  nt(-)  where  as  e  ♦ 

0  the  bandwidth  (BW)  •*  ®.  Under  conditions  to  be  imposed,  the  se¬ 
quence  of  outputs  (with  input  signal  s(-)  plus  noise  nc ( • ) )  will  con 
verge  to  a  process  whose  state  variable  representation  is  a 
Markov  diffusion  process,  and  we  will  readily  be  able  to  find 
that  process.  Implicitly  or  explicitly  (as  we  indicate  in  the 
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examples  below)  many  of  the  current  heuristic  arguments  use 
a  similar  assumption  on  the  noise  -  at  least,  the  "output 
approximation"  may  not  make  sense  unless  it  is  viewed  as  the 
limit  of  a  sequence  of  outputs  in  our  sense. 

Our  method  has  the  advantage  that  the  assumptions  are 
clearly  seen,  it  is  applicable  to  a  great  variety  of  situations, 
and  the  terms  which  a  more  heuristic  analysis  would  drop  can  be 
clearly  seen.  The  limit  is  in  the  sense  of  weak  convergence  of 
probability  measures  (2).  Thus  the  distributions  of  a  great 
variety  of  functionals  of  the  sequence  of  outputs  (with  para¬ 
meter  c)  converge  to  that  of  the  limit.  Furthermore,  under  cer¬ 
tain  circumstances  additional  information  on  approximations  to 
stationary  measures  can  be  obtained.  Also,  nonstationary  inputs 
can  be  accommodated.  The  fact  that  y(*)  can  occur  nonlinearly 
in  (2.1)  -  (2.3)  below  is  important  in  the  applications  which 
involve  some  nonlinear  processing. 

In  the  communications  literature,  the  problem  of  obtaining 
the  (Markov-di f fusion )  limit  of  the  sequence  of  outputs  of  a 
system  as  the  input  noise  BW  tends  to  «•  was  perhaps  initiated  by 
Wong  and  Zakai  (3],  I  4]  in  a  very  special  case.  Later  cases 
were  treated  by  Khazminskii  (5),  Papanicolaou  and  Kohler  [6], 
Papanicolaou  and  Blankenship  (  7 ]  and  Kushner  (81,  (  9  ) .  The 
treatment  here,  based  on  a  semigroup  approximation  of  Kurtz  |  10]  » 
was  developed  in  [  8 ) ,  f  9 ]  to  get  limit  theorems  of  the  desired 


type. 


V 


In  Section  2,  the  basic  model  is  discussed,  together  with 
the  general  scheme  of  Kurtz  (10],  and  the  main  approximation 
theorems  appear  in  Section  3.  Sections  4,  5  and  6  deal  with  the 
three  problem  classes  (a),  (b)  and  (c)  mentioned  above.  The 
theory  is  developed  first  for  the  canonical  models  (2.1)-(2.3). 
Often  in  applications,  such  as  those  in  Sections  4-6,  the  models 
are  a  little  different.  But,  as  we  will  see,  the  development 
give  for  the  canonical  model  tells  us  exactly  how  to  proceed  in 
the  other  cases.  In  a  project  currently  under  way,  the  method 
is  used  to  study  a  class  of  PLL's  with  non-linear  filters  (which 
seems  to  have  certain  advantages),  a  problem  which  has  not  been 
treated  and  for  which  there  seems  to  be  no  other  "natural”  method 
at  present. 


2.  The  Basic  Model 

Noise  model.  First,  we  derive  the  noise  model.  In  order 
to  conveniently  get  a  class  of  processes  nL(*)  whose  BW  goes  to  ® 
and  energy/unit  BW  converges  to  a  constant  ^  0  as  e  -*  0,  we 
work  with  n  (•)  of  the  form  nc  (t)  ■  yc(t)/e,  where  y'  (t)  -  y(t/e^) 
and  y  is  a  stationary  process.  Other  forms  are  possible.  In 
particular,  see  (  9],  where  n'  (*)  is  built  up  from  a  sequence 
of  small  correlated  effects,  each  of  whose  "size"  -»  0  and  the 
number  of  which  (in  any  finite  interval)  goes  to  »  as  e  -*  0. 

Other  forms  are  possible  -  and  yield  rather  similar  results. 

One  way  or  another,  an  explicit  model  for  nl (•)  must  be  given 


which  allows  BW 


®  as  e 


0.  The  selected  model  is  one  useful 
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6. 


choice.  As  seen  below,  it  has  the  desired  properties.  The 
method  to  be  developed  can  handle  many  other  useful  noise  models 
as  well.  Basically,  the  noise  must  be  parametrized  in  such  a  way 
that  Theorem  1  can  be  adapted  to  the  problem. 

Suppose  that  y(*)  has  a  spectral  distribution  S(w).  Then 

that  of  n'  (•)  is  SU^w)  =  S  (w)  .  See  Fig.  1.  Note  that  the 
2 

t/e  scaling  spreads  the  BW  (and  the  "center"  frequency  in  the 
band  if  there  is  one  (as  in  Fig.  lb))  and  the  1/t  scale  keeps 
the  energy  per  unit  BW  from  degenerating.  Without  the  1/t 
factor  the  energy  per  unit  BW  goes  to  zero  as  t  *  0,  and  all 
the  limits  are  "noiseless".  We  do  not  require  that  y(-)  has  a 
spectral  distribution.  The  above  remark  is  for  motivation  only. 

System  models.  There  are  several  canonical  forms  with  which 
we  can  work.  The  system  outputs  can  be  representable  (state 
variable  form)  by  one  of  the  related  ODLs  (ordinary  differential 
equations) 

(2.1)  -  G(xc,ySt)  ♦  F (xc ,y'  ,  t)/c, 

(2.2)  k'  -  Gc(x’  ,yf',t)  +  F  (x‘  ,ye,t)/c, 

(2.3)  x‘  -  G  (x*  , t )  ♦  F(  (x’  ,yc,t)/e,  x’  (0)  €  Rr  , 

Euclidean  r-space. 
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In  order  to  simplify  the  development,  we  deal  mostly  with 
(2.1),  and  then  show  how  to  extend  the  result.  As  will  be  clearly 
sean  in  Sections  4-6,  the  forms  (2.1) -(2. 3)  cover  many  common 
cases,  and  the  method  to  be  described  can  readily  be  extended  to 
many  other  cases.  The  t  argument  accounts  for  the  presence  of  the 
signal  or  other  non-stationarities  (see  Sections  4-6). 

Weak  convergence.  Let  Dr(0,®)  be  the  space  of  Rr-valued 
functions  on  (0,®)  which  are  right  continuous,  have  left-hand 
limits  and  let  the  space  have  the  Skorokhod  topology  (l  2  )* 

Section  14),  as  is  usual  in  studies  of  weak  convergence.  Each 
process  x' ( • )  has  paths  in  Dr[0,®).  In  fact  the  paths  are  con¬ 
tinuous.  Let  x(-)  be  a  process  whose  paths  are  also  in  Dr(0,®). 

We  say  that  xc ( • )  5  x(-)  (weak*  convergence  of  the  corresponding 
measures)  iff  for  each  bounded  continuous  real-valued  f(*)  on 
Dr[0,®),  Ef (xc  ( • )  )  ♦  Ef(x(*)).  This  is  a  considerable  general¬ 
ization  of  convergence  in  distribution,  and  is  a  concept  which 
is  very  useful  in  many  areas  of  probability  and  statistics.  In 
addition,  if  xc(*)  ■*  x(*)  weakly  in  Dr(0,®)  and  f(.)  is  bounded 
measurable  and  only  almost  everywhere  continuous  with  respect  to 
x(*)  measure,  then  Ef(x'  (*))  ♦  Ef(x(*))  as  c  ♦  0  also. 

A  sequence  (x1  (•))  is  tight  iff,  for  each  6  >  0,  there  is 
a  compact  set  K,  in  Dr[0,®)  such  that  sup  P(x  (•)  €  K^)  _<  6. 

Suppose  that  x(*)  has  continuous  paths  w.p.  1.  Then  the 
two  usual  steps  in  proving  weak  convergence  are:  (i)  showing 
convergence  of  finite-dimensional  distribution  of  (x  (•)}  to 


those  of  x(>);  (ii)  showing  tightness  of  (x  (•)).  These  imply 
the  weak  convergence.  The  theorems  given  below  do  all  this 
efficiently.  Our  limit  process  x(*)  will  be  a  Markov  diffusion, 
and  the  theorems  below  allow  us  to  calculate  its  infinitesimal 
operator.  For  more  detail  on  weak  convergence  see  (2). 

A. 

An  example  of  the  limit  operator.  Let  "£ ^  denote  the 
set  of  bounded  continuous  functions  on  (0,®)  *  Rr  with  compact 

A  g 

support,  't, ‘o'"  the  subset  with  continuous  a-partial  t-deri- 

A 

vatives  and  fci-partial  x-derivatives ,  and  let  jtf’  denote  the 

f\ 

closure  of  fc*.  under  uniform  convergence.  Under  conditions  to 
be  imposed  (including  EF(x,y(s),t)  -  0  for  all  x,  t) ,  and  with 
the  model  (2.1),  the  infinitesimal  operator  (3/3t  +  A)  of  the 
limit  process  x(*)  is  (acting  on 


(2.4)  (d/dt  ♦  A)  f  (x,  t)  *  ft(x,t)  +  Ef^(x,t)G(x,y (0) ,t) 


EF'  (x,y  (0)  ,  t)  ( F  *  (x  ,y  (s  )  ,  t)  f  (x,  t)  )  ds 


l  b .  (x , t ) f  ( x , t )  +5  l  a  .(x,t)f  (x,t), 

t  1  X:  2  i,j  1J  *i*j 


where  b(*,»)  and  a(*,*)  ■  ^aij (*»*))  are  defined  in  the  obvious 
manner,  and  we  assume  that  a(-,*)  is  symmetrized  to  conform  with 
the  usual  form  of  the  operator  A.  Define  G(x,t)  -  EG (x ,y (0 ) , t ) . 


If  there  is  a  matrix  a(*,*)  such  that  a  ( • , - ) 


o (’,•) o (•,•)' /2 ,  then  there  is  a  standard  vector-valued  Wiener 
process  B(-)  such  that  x(*)  has  the  It6  equation  representation 

(2.5)  dx  *  b(x,t)dt  ♦  a (x, t )dB. 

This  is  the  case  in  the  examples.  Note  that  b(x,t)  contains  two 

components  -  the  first  is  G(x,t)  and  the  second  is 

/EF  *  (x  ,y  (s)  ,  t)  F  (x , y  (0 )  ,  t )  da  ,  where  we  define 
0  x 


The  last  term  arises  for  the  same  reasons  that  cause  the  Wong- 
Zakai  [3],  [4]  correction  term;  i.e.  the  interaction  between 

xL  (t)  and  nc(t).  As  it  turns  out,  the  typical  heuristic  argu¬ 
ments  used  to  deal  with  problems  (a),  (b)  obtain  "limits" 

without  the  "correction  term". 

e 

Some  definitions.  Let  Efc  denote  expectation  conditioned 
on  n*  (s),  s  <_  t.  If  k*  (•)  is  an  w,t  function  such  that  for 
each  T  <  ®  and  for  (u>,t)€  (1  *  10, T],  it  is  measurable  on  the 
product  o-algebra  *  4  [0  ,T]  *  £MnC  (s)  ,  s^T)  ,  we  9ay 
that  k*  €  y  ,  the  class  of  progressively  measurable  functions. 


* &[ 0,T]  is  the  Bore  1  algebra  over  (0,T). 
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Let  k  and  k  be  in  y'  .  We  say  that  p-lim  kl  *  k*  iff 
n  1  r  n  n 

sup  sup^E'k'  (t)  I  <  ®  and  E|k^  (t)-kL  (t)  I  •*  0  as  n  -*  «»  for  each 
n  t  n  n 

£ 

t.  Let  y  C  y  be  the  subclass  of  functions  k  such  that 

^  C  A  e 

sup^E  k(t) !  <  ®.  Let  y  q  denote  the  subset  of  y  of  p-right 
continuous  functions;  k  being  p-right  continuous  means  that 

A 

k€  y  and  for  each  t,  E  j  k  (t+s)  -k  (t)  |  ■*  0  as  s  +  0.  If  (some 
version  of)  p- limg^Q [ E^k '  ( t+s ) -k ’  (t)J/s  exists  in  y  ^ ,  it  is 
called  A  k  and  we  say  that  kL  G  ,  the  domain  of  the 

operator  A  ,  an  operator  which  is  analogous  to  the  weak  infin¬ 
itesimal  operator  of  a  Markov  semigroup.  If  k’  G  y  c,  we  say 
that  p-lim  kL  -  0  if  sup  E|kc(t)|  <  ®  and  E|k'  (t) |  *  0  as 

G  #  u 

c  ■*  0  for  each  t.  The  functions  introduced  in  Theorem  2  and  it 

its  proof  have  progressively  measurable  versions. 

Kurtz's  semigroup  approximation  theorem  (IQ  1  ,  adapted  to 
our  purposes.  We  treat  t  as  a  component  of  the  state  vector, 
in  order  to  allow  us  to  work  with  nonstationary  cases.  The 
conditions  will  be  commented  on  below.  They  are  more  readily 
verifiable  than  may  be  apparent.  The  following  theorem  (3)  is 
the  basis  of  our  method. 

Theorem  1.  Let  Z  (•)  *  (x‘  (- ) ,y*  (•))  be  a  sequence  of  Rr*r  - 
valued  right  continuous  processes,  x ( • )  a  (Rr-valued )  Markov 

A  A 

process  with  semigroup  T(*)  mapping  56*  into  and  which  is 


-37- 


strongly  continuous  on  (sap  norm) .  For  some  X  >  0  and  dense 

A  Ai  n 

set  9  C  &  (which  will  be  % „ '  ) ,  let  Ranqe ( X-A-3 /d t !  )  be 

-  -  0  - rj  — 

dense  in  ,  where  A+3/J  t  is  the  infinitesimal  operator  of  the 
process  ( x  ( t )  ,  t )  .  Suppose  that,  for  each  f  (E  J ,  there  is  a 
sequence  { f ' }  of  progressively  measurable  functions  such  that 
fC  €  £2(  A  '  )  and 


p-lim  (  f  L  ( •  )  -f  (x*-  (• )  )  ]  -  0 


p-lim{  A  Lfc  (•  )-  (At- 3/3t)  f  (xe  (•),•)  ]  -  0. 

Then  if  x '  ( 0 )  -*  x  ( 0 )  in  distribution,  the  finite-dimensional 
d istributions  of  x 1 ( • )  converge  to  those  of  x ( • )  ( with  initial 

condition  x(0))  as  c  ■*  0. 


Remark .  There  is  a  similar  theorem  for  tightness  of  {x*  (•)} 
which  is  particularly  useful  for  the  types  of  problems  encoun¬ 
tered  here.  In  fact,  if  the  finite-dimensional  distributions 
converge  (Theorem  1),  a  proof  of  tightness  under  reasonable 
conditions  is  not  hard.  See  (8,  Theorem  2,  Part  4] 
for  a  method. 


In  (2.4)  the  operator  (A+d/dt)  was  defined  on  a  set 

^1  3 

& q  .  In  Theorem  1,  (A+d/dt)  is  considered  on  a  dense  subset 

A  a  i  3 

€  V  (which  is).  The  question  of  concern  is:  does 

this  restriction  of  (A+3/3 1)  define  the  infinitesimal  operator 

A 

of  a  Markov  semigroup  uniquely?  If  the  closure  in  of  this 

restricted  operator  is  the  infinitesimal  operator  of  x(*)»  then 


A+3/Ot  defines  TC)  uniquely.  Since  we  can  only  work  (in  the 
proofs)  with  nice  classes  such  as  £/  -  and  not  with 

the  domain  of  the  infinitesimal  operator  of  x(*),  it  is  important 
to  know  if  is  big  enough  to  yield  the  limit  uniquely.  In 

fact,  the  condition  on  the  density  of  the  range  holds  in  all  the 
cases  of  Sections  4-6,  and  is  rather  unrestrictive . 

Due  to  lack  of  space,  it  is  not  possible  to  discuss  the 
relative  advantages  of  Theorem  1  (or  Theorems  2  and  3)  to  the 
approximation  problem  over  more  classical  semigroup  approxima¬ 
tion  methods.  It  seems  to  be  much  easier  to  use  in  the  usual 
problems  encountered  in  control  and  communication  theory 
And  the  relevant  proofs  (e.g.  that  of  Theorem  2)  are  shorter 
and  use  better  conditions. 

The  "density”  condition  together  with  the  condition  on 
strong  continuity  of  T(-)  can  be  eliminated  by  an  alternative 
approach  (11]  which  replaces  them  by  the  simple  assumption  that 
to  the  coefficients  a(*,-)  and  b(-)  of  A  there  corresponds  a 
stochastic  differential  equation  with  a  unique  solution  ( in  the 
sense  of  distributions) .  This  condition  also  holds  in  our 
examples.  The  proof  of  the  theorem  corresponding  to  Theorem  2 
in  that  case  would  be  almost  the  same.  We  stick  to  am  approach 
based  on  Theorem  1  because  it  is  also  applicable  and  the  refer¬ 
ences  are  currently  available. 

We  next  give  some  specializations  of  the  theorem  suitable 
for  our  applications.  Theorem  1  is  given  in  the  general  form 
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becsuse  it  shows  how  to  modify  the  following  specializations  when 
variants  are  required  for  particular  cases.  We  omit  explicit 
discussion  of  tightness,  due  to  lack  of  space.  Our  conditions 
will  guarantee  the  tightness,  via  the  methods  of  proof  of  [8]. 

•/  4 

'  V  V  I  )' 

3.  The  Main  Convergence  and  Approximation  Theorems 

We  start  with  the  form  (2.1)  and  bounded  y(*),  because  it 
is  good  enough  for  many  applications  and  illustrates  the  tech¬ 
nique  with  the  least  notational  encumbrance.  Then  we  discuss 
the  case  where  y ( * )  is  unbounded  and  F(x,y,t)/t  »  F(x,t)y/c. 
Finally,  we  remark  on  the  cases  (2 . 2 ) - (2 . 3),  which  actually  occur 
in  some  of  the  examples. 

Assumptions 

( Al )  F (*,*,• )  and  G (*,*,* )  are  continuous,  the  first 

(second,  resp.)  having  two  (one,  reap.)  continuous 
(in  x,  y,  t)  partial  x - derivatives . 

(A2)  | F (x, y , t ) |  ♦  |G(x.y,t)|  <M(l*|x|),  for  some  constant 

M. 

(A3)  y(*)  is  stationary,  bounded,  right-continuous, 

EF(x,y(s),t)  -  0,  each  x,  t,  and  y ( • )  is  strong 

mixing  in  the  sense  that  there  is  a  function  o  ( * ) 

•  1/2 

satisfying  fp  (s)ds  <  <*>  and 
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A 


sup  ;P(B  A)“P(B)  |  ^p(t), 

A,  B,  s 

A  €  (y  (u)  ,  u^s )  ,  B  <4  (y  (u)  ,u>s*t) . 

(Such  a  condition  is  quite  common  in  the  literature 
on  applications  of  weak  convergence  theory.  It  is 
satisfied  by  truncated  Gaussian  processes  with 
finite  BW  and  continuous  spectrum,  by  bounded 
erqodic  Markov  chains,  etc.) 

(A4)  The  operator  A+S/3t  is  the  restriction  to  & Q  of 

the  infinitesimal  operator  of  a  strong  Markov  process 

A  A 

with  semigroup  T(*)  mapping  into  and  being 

A 

strongly  continuous  on  ^ . 

A 

( AS )  A*? /at  on  its  domain  in  fc*  is  determined  by 

*  i  3 

its  action  on  fc* q'  . 

Remark.  (A4)-(A5)  hold  in  our  cases  and  in  the  usual  situations 
which  arise  in  communication  theory.  They  pertain  only  to 
the  limit  x(*),  and  not  to  the  xe  ( • )  .  Further  remarks  appear 
in  (8].  See  also  the  comments  at  the  end  of  the  last  section 
concerning  simplifying  the  conditions. 

Theorem  2.  Let  x  (0)  -*  x(0)  in  distribution.  Then,  under 
(A l ) — (AS ) ,  (x*  (•))  converges  weakly  in  Dr(0,»)  ^o  x(*),  _a 

diffusion  whose  infinitesimal  operator  O/at+A)  is  given  by 


(2.4) ,  and  vith  initial  condition  x ( 0 ) . 
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Outline  of  proof.  The  proof  lor  the  non- time-dependent 

case  appears  in  [  g  ) ,  and  is  a  direct  application  of  Theorem  1 

(for  convergence  of  finite-dimensional  distributions),  and 

•  *1  3 

another  result  of  ( 10 ]  for  tightness.  Given  f  €  ,  the 

main  object  is  to  get  the  sequence  (f  (•)  )  of  Theorem  1,  and  to 
verify  the  p-lim  requirement  of  that  theorem.  We  outline  this 

because  various  extensions  of  the  method  are  needed  for  the 
examples,  and  it  is  useful  to  have  an  explicit  outline  for  the 
time-dependent  case,  since  tho  forms  of  some  of  the  functions  are 
a  little  different.  Reference  [8]  dealt  only  with  G,  F  not 
depending  on  time,  but  tho  time-dependent  case  is  important  for 
applications  and  requires  only  a  few  changes  from  the  treatment 
in  [8].  The  method  of  getting  the  f  (•)  is  adapted  from  the 
averaging  method  in  (7) . 

Let  f  €  V  Q'  .  We  construct  f’  in  the  form  f (t)  - 
f  (x*  (t) ,t)  ♦  / i_0f i <x  <t) ,t) .  Our  f ‘  is  the  cf  j  of  (  8  ] ,  our 
f o  ls  split  off  from  the  t2f^  term  in  (  8  )  and  our  f'2  is  the 
remainder  of  the  t-2f^  term  there.  The  terms  are  modified  to 
account  for  the  explicit  t-dependence  of  f,  G,  F.  Note  that 
(use  x  »  x  (t)  for  notational  simplicity) 

(3.1)  A  f(x,t)  -  f t ( x , t )  ♦  f^(x,t) [G(x,yc(t),t)  ♦  F(x,yt(t),t)/e). 
Define  f^x.t)  by 


J 


1 

a 
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fjjlx.t)  -  |  (x,t^s)  |G(x,y‘  (t+s)  ,t*s)-G(x,t*-s)  ]ds 

0 

ao 

•  C2  |  E^.f^(x,t+e2s)  lG(x,y  (^y+s)  .t+t2s)-G(x.t+c2s)  )ds. 

0 

The  integral  exists  for  each  e  >  0  by  (Al),  (A3)  and  the  compact 

support  of  f.  Also  (t)  =  £q(xc  (t)  ,t)  ■  0(c2)  and  fg(*)  G  Li*(A  ). 
Again  use  x  ■  xL  (t) .  Then 

(3.2)  ALf!,(x,t)  -  -f *  (x,t)G(x,yl  (t) ,t)  ♦  f  •  (x,  t)G  (x,  t)  ♦  0(c)  term* 

Note  that  a’  (f  (x,  t)  +f  q  (x,  t)  ]  -  ft(x,t)  ♦  f  (x,  t)G  (x,  t)  ♦ 

0(e)  terms  ♦  f’(x,t)F(x,yc (t) ,t)/c.  The  term  G(x,yL  (t) ,t)  in 
(3.1)  has  thus  been  replaced  by  its  average  G(x,t)  modulo  an 
0(e)  term.  This  was  the  reason  for  the  addition  of  the  f^  term. 

A  similar  "averaging"  scheme  will  be  used  to  replace  the 
f^(x,t)F(x,yC (t) ,t)/c  term  by  the  rest  of  Af  modulo  0(e).  This 
will  be  done  in  two  steps  by  using  the  f ^  and  f ^  defined  below. 
Proceeding,  define  f ^  by 

OD 

(3.3)  f^(x,t)  -  ~  |  Ej.f^(x,t*s)F(x,yr  (t+s)  ,t+s)ds 

0 

»  e  f  E!.f '  (x,t+e2s)F(x,y  (^+a)  ,t+c2s)ds  ■  0(e). 
j  t  x 

Furthermore,  f^(*,t)  ia  differentiable  in  x  and  f^(-)  = 

f^(xe  (•),*)  €  ^(Ac)  and  (again,  setting  xc  (t)  -  x) 


(3.4)  A'f^ix.t)  -  -f^(x,t)F(x,yl  (t),t)/c  *■  0(t) 

♦  (f  J  (x,t)  )^lG(x,yl  (t)  ,t)  ♦  F(x,y‘ (t) ,t)/cl . 

It  can  be  shown  that  the  gradient  ( f ^ ) x  can  be  obtained  by  dif¬ 
ferentiating  with  respect  to  x  under  the  integral  in  (3.3).  Also 
(f^  (x,t)  )M3(x,y'  (t),t)  -  0(t),  and  we  ignore  this  component 
henceforth.  The  first  term  on  the  right-hand  side  of  (3.4) 
cancels  the  last  term  of  (3.1).  To  get  the  p-lim  result  required 
for  Theorem  1,  we  need  now  only  choose  f^  to  "cancel  the  effect 
of" 


(3.5)  (fj(x,t))^F(x.yfc(t),t)/c  -  f  k(f;(x.fs)F(x,yt(fs).fs))xds 


•  lF(»,y1'  <t>  ,t) ) . 

c 


Define  AQf  (the  average  value  of  (3.5)  -  change  variables 
s/c2  *  s  and  use  the  stationarity  of  y(*))  by 


(3.6)  Ajjf(x,t) 


EF* (x,y (0) , t) |f^(x,t-*-c2s)F(x,y(s),t+c2s) ] yds . 

« 

0 


(3.6)  exists  by  the  strong  mixing  (A3)  and  the  fact  that 
EF(x,y(s),t)  =0.  As  e  *  0,  (3.6)  converges  uniformly  in  x,  t 
to  the  integral  in  (2.4). 

Now,  define  f^(x,t)  by 


ft(x.t)  -  ds  {  du  F '  (x,  yc  ( t+s )  ,  t+s )  l  f (x ,  t+s+u) 

l  J  J  Z  * 

0  0  e 

•  F(x,y(  (t+s+u) , t+s+u) ] ^  -  Aq  f  ( x  ,  t  ♦  s  )  } 

c 

OD  OD 

-  c2rds{  du  E'F'(x1y(^8),tU2s)[fMx(tH2sn2u) 

0  0  C 

•  F  (x,y  (~+s+u)  , t+e2s+e2u) Jx  -  Aq f  (x , t+t  2s )  } 

-  0(c2). 

2 

The  integral  exists  and  equals  0(e  )  by  the  centering  about 
the  mean  value  A^f,  the  strong  mixing  (A3)  and  the  compact  sup¬ 
port  of  f.  Now, 

p- 1 im I f q  ♦  +  f£]  -  0, 

p-lim(Acfc (• )  -  0  /3  t+A) f (xc (“)#*) 3  -  0, 

and  Theorem  1  yields  the  convergence  of  finite-dimensional  dis¬ 
tributions.  The  tightness  argument  is  the  same  as  that  in 
(  8  ,  Theorem  2].  The  proof  concludes  by  noting  that  now  all 
the  conditions  of  Theorem  1  hold.  Q.E.D. 

Unbounded  y(*)  and  form  F(x,y,t)  •  F(x,t)y,  G(x,y,t)  ■ 
G(x,t)  ♦  GQ(x,t)y.  The  treatment  of  the  unbounded  (e.g. 

Gaussian  y(*))  case  is  similar  to  that  of  the  bounded  y(a)  case. 


but  somewhat  more  stringent  conditions  need  to  be  imposed  on 


the  form  of 

F. 

Define 

v(t)  « 

/ E  y(t*s)ds,  where  Ef  denotes  conditioning 

0 

on  y (u) ,  u 

<  t.  Let  there  be  some  p  >  0  such  that 

(AIM 

sup  E  ( 
t  J 

OD 

|  |Ety (t*s) |ds) 2+p  < 

0 

(A2M 

E|y (t) 

|2+p  <  -,  Ey  (t )  =  0, 

(A3M 

sup  E( 

1  i 

OD 

ds | Efcy (t  +  s) v*  ( t+s ) -Ey (t  +  s ) v '  (t+s )  |  ) 2  +  t  <  ®, 

) 

(A4M 

y(M  is  stationary  and  right  continuous, 

(ASM 

F,  G,  Ga  are  continuous  together  with  their  second 

(first 

for  G,  Gq)  partial  x-der ivatives . 

Conditions  (A1M“(A4M  are  satisfied  by  any  process  which 
is  a  linear  combination  of  the  states  of 

(3.7)  du  •  Audt  *  Bdw, 

A  asymptotically  stable,  w(-)  -  Wiener  process.  Since  such 
processes  constitute  the  class  of  Gaussian  processes  with 
rational  spectral  densities,  (A1M“(A4M  are  certainly  not  res¬ 


trictive. 
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In  l  81,  the  unbounded  y(*)  case  was  treated  in  Theorem  5, 
and  GqY  was  not  explicitly  included.  The  proof  there  goes 
through  without  any  additional  conditions  or  difficulty  if  G^y 

is  added  -  provided  that  Gq  has  continuous  x-first  partial 
derivatives.  In  that  proof,  it  was  difficult  to  work  with  un¬ 
bounded  F,  G  when  y(*)  was  unbounded,  so  the  following  assump¬ 
tion  was  added  (adapted  to  our  case  here) . 

N  «•  N  N 

(A6' )  For  each  N,  there  are  functions  F  ,  G  ,  Gq  equal 
to  F,  G  and  GQ,  resp.,  in  SN  -  {x:  |x| <N),  but 
bounded  and  smooth  (as  smooth  as  F,  G,  G^  are) 
out  of  SN,  and  such  that  (A4),  (A5)  hold. 

This  condition  is  normally  satisfied  and  holds  in  our  examples. 

The  reference  to  (A4) ,  (AS)  can  be  dropped  if  the  approach  in  [11] 
is  used  (it  is  then  replaced  by  uniqueness  of  the  solution  to  the 
Ito  representation  of  x(-)). 

Theorem  3.  Under  (A2),  (A1,)-(A6'),  and  xL(0)  ■*  x(0)  in  distri¬ 

bution,  the  finite-dimensional  distributions  of  (xc  ( - ) )  converge 
to  those  of  x ( • ) •  If  y ( • )  is  given  by  a  linear  combination  of 
the  states  of  (3.6),  then  {x'  (•))  is  also  tight  and  {xe  ( - )  }  ■* 
x  ( -  )  weakly  in  Dr(0,°°). 

The  proof  is  similar  to  that  of  Theorem  2.  Given 
f  G  fc*Q  >  we  construct  f  as  in  Theorem  2  and  prove  the  p-lim 
requirements  of  Theorem  1.  See  [  8]  for  the  details  in  the 
non-time-varying  case. 
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Extensions  to  (2.2),  (2.3).  When  F  and  G  depend  on  t , 

3 

the  procedure  is  exactly  the  same.  Given  f  €  ^ q'  •  we  construct 
fl  as  done  in  Theorem  2,  making  sure  that  the  integrals  are  well 
defined  and  of  the  proper  order  in  c,  and  replacing  G,  F  by  G  (  , 

F  .  We  need  EG(  (x,y (0) , t)  -  G(x,t),  a  continuous  function, 
uniformly  on  bounded  (x,t)  sets,  and  that  for  each  f  €  't  q'  , 


(3.8)  f t ( x , t )  ♦  G’  (x,t)fx(x,t)  ♦  |  E  F'(x,y(0),t) 

0 

•  (f(x,t+c2s)F(x,y(s) , t+e2s> )  ds 

X  ^ 

converges  uniformly  on  bounded  (x,t)  sets  to  (3/3t+A) f (x , t) . 

Example  (c)  (Section  6)  require*  a  slightly  different  exten¬ 
sion,  but  the  general  idea  is  the  same. 

4.  The  Phase  Locked  Loop 

The  standard  PLL  is  represented  in  Fig.  2  and  is  ,  perhaps,  the 
simplest  application  of  the  foregoing  ideas.  Via  suitable 
choices  of  D,  E,  C,  Q  all  the  usual  filters  can  be  constructed. 

We  first  do  the  case  D  •  E  ■  C  ■  0,  0-1,  which  yields  the 
standard  form  of  the  first-order  loop  ( 12l •  The  general  case 
is  treated  in  precisely  the  same  way,  and  is  given  below.  The 


i 


v^bOvVEv/ 

z*=Cv*+Qu* 
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function  6(-)  denotes  the  input  phase  process  and  6(-)  its 
estimate,  as  determined  by  the  loop.  Then 

(4.1)  0e  “  ^(sin(O-0t)  +  sin  (0*9l>2uJot)  ]  +  K  cos  (u^t  +  e'  )-n'  (t), 

and  we  have  the  situation  of  either  Theorem  2  or  3,  under  broad 

c  e  2 

conditions  on  y(*),  where  we  use  n  (t)  -  y  (t/c  )/e. 

The  usual  method  of  getting  an  approximate  diffusion  equa- 
tion  proceeds  roughly  as  follows  Il2l*  First  the  "double 
frequency"  terms  are  dropped  (which  is  justified  for  large 
irrespective  of  the  filter  form,  as  we  see  below),  then  it  is 
argued  that  since  6e(*)  varies  "much  more  slowly"  than  nfc(*)»  one 
can  replace  n  (•)  by  a  white  Gaussian  noise  of  the  same  power 
per  unit  BW  that  n' (•)  has  within  the  (say)  pass  band  of  the 
filter. 

Actually,  as  the  BW  of  n'  (•)  increases  (justifying  in  a 
sense  the  "increasing"  independence  of  n'  (t)  and  0c(t)),  the 
magnitude  of  n’  (•)  must  increase  (see  Section  2)  making  the 
replacement  of  n  (*)  by  white  Gaussian  noise  a  little  worrisome. 
In  fact,  it  was  a  "paradox"  arising  from  a  problem  of  this  sort 
which  apparently  led  to  the  original  Wong-Zakai  I  3  )  work. 

The  weak  convergence  method  of  Theorems  1-3  explicitly  yields 
the  correction  term,  and  owing  to  the  nature  of  weak  converg¬ 
ence,  the  distributions  of  many  functionals  of  the  limit  are 
close  to  those  of  xk  (•)»  for  example,  first  exit  times  from 


appropriate  sets.  Indeed,  in  order  to  study  approximations  to 
path  properties  of  xL(-)  via  x(-),  weak  convergence  seems 
to  be  the  appropriate  technique.  This  is  an  important  advantage 
which  the  traditional  methods  do  not  have. 

Define 

OD 

R  »  |  Ey (0 ) y (s ) ds , 

0 

and,  in  order  to  fix  ideas,  let  y  ( * )  satisfy  the  noise  conditions 
of  either  Theorem  2  or  3.  All  the  other  conditions  of  these 
theorems  hold.  Then  the  theorems  yield  that  {0L('))  converges 
weakly  to  the  process  6  ( •  )  with  the  operator  0/3  t+A)  given  by 

(4.2)  (3/3t+A)f  (0,t)  -  f  t  (0  #  t ) 

♦  fx(0,t)  l^sin(0-0)+sin(0+0+2u>ot)l 

♦  fx(0,t)  (-K2Rcos  (u>Qt+0)  sin  (u)Qt+0)  ] 

+  K2Rcos2  (wQt+®^xx  ^  ^  • 

The  quantity  2R  is  roughly  the  power  per  unit  BW  of 

nc ( • )  for  small  c.  Also,  as  e  -  0,  /nE <s)ds  5  xc (t)  converges 

0 

to  a  Wiener  process  with  infinitesimal  covariance  2R.  To  see 
this,  set  fc*  ■  n‘  ,  and  use  Theorem  2  or  3  as  appropriate,  to  get 
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that  xC  converges  weakly  to  a  process  x(')  with  infinitesimal 
operator  0/3  t+R  32/3x2),  i.e.,  to  a  diffusion 

dx  -  /2R  dw 

where  w(*)  is  a  standard  Wiener  process. 

From  the  form  of  (4.2)  it  is  easily  seen  that  the  limit 
9 ( • )  is  a  Markov  diffusion  with  an  It6  process  representation. 

In  particular,  there  is  a  standard  Wiener  process  B(*)  such 
that  6 ( • )  is  represented  by 

(4.3)  d0  -  y^(sin (0-6)+sin (O+0+2wot) )dt 

-  K2R  cos  (u)gt+0 )  sin  (uigt+0 ) dt  ♦  K/2R  cos (wQt+0 )dB. 

The  "correction"  term,  the  second  one  on  the  right,  is  not 
accounted  for  by  the  traditional  analysis,  and  arises  due  to  the 
non-independence  of  © c  ( t )  and  n’  (t).  It  is  proportional  to  K2R. 
For  large  power/unit  BW  of  n' (■),  or  large  system  gain,  this 
term  might  be  of  importance. 

The  general  rth-order  loop.  For  general  D,  E,  C,  Q  in  Fig. 

r 

2,  9  *  Cv  +  Qfr.h.s.  of  (4.1))  and  the  limit  process 

(v (•),&(•))  is  representable  by  an  It6  equation  of  the  form 

v 

(4.4)  d<  )  - 

e 


<£>v  dt  ♦  (q) (r.h.s.  of  (4.3)) . 
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The  result  for  the  general  filter  is  just  as  easy  to  get  as  the 
result  for  (4.1),  since  the  general  filter  only  affects  the  G 
term  (in  the  notation  of  (2.1). 

The  limit  as  w  -  «.  We  consider  (4.3)  as  -  ».  The 
same  result  holds  for  (4.4).  It  is  not  hard  to  see  that  the  two 
middle  terms  of  the  right  side  of  (4.3)  should  disappear  as 
-*  <=■>,  but  it's  a  little  harder  to  see  what  to  do  about  the 
cos(u>0t  +  e)  coefficient  of  dB,  since  this  coefficient  depends  on 

9.  The  result  will  be  the  "traditional"  one,  but  it  is  often 
dangerous  to  use  heuristic  methods  to  treat  problems  involving 
"products  of  white  noise  and  state  variables".  Write  the  solu¬ 
tion  to  (4.3)  as  e(wQ,*).  We  have  the  following  theorem. 

Theorem  4.  (0(ujq,*))  is  tight  in  D[0,<*>)  and  as  -*  ®,  it 

converges  weakly  to  the  process  0(*)  given  by 

a  A  K  A  _ 

(4.5)  d0  =  sin (0-0 )dt  +  K/R  dB. 

Proof .  Apply  Theorem  1  directly.  The  indexes  the 

sequence  rather  than  t.  The  proof  will  be  outlined  only.  The 
—0 - 

state  is  x  (t)  =  {  ).  All  functions  (cos  (u>gt+0)  , 

sin(Wgt  +  0),  etc.)  in  (4.3)  are  Lipschitz  continuous  in  0,  uni¬ 
formly  in  t,  uig,  and  all  are  bounded.  From  this,  we  can  easily 
show  that  there  is  a  constant  C  such  that 

E  |  0  (wq  ,  t+s ) -0  (u>Q,  t)  <  Cs^,  all  t,  s,  u>q  . 
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ey 


[2,  Theorem  12.3],  this  implies  tightness  of  (§(wq,*)).  Let 


denote  conditioning  on  B(s),  s  t. 
Next,  fix  f  €  tf,1'3 


U), 


0 


Def ina 


*  <t>  *  f? 


Etf  (8  (uiQ,t^s)  )  ,  t)  ds  . 


w0  -0 

Then  it  can  be  shown  that  f  (•)  v  £i^(A  )  and  that 


E^f (0(wn,t+2»/uo) ,t)-f (e(wQ,t),t) 
A  °f  (t)  -  - 


2”/^, 


2r/u>, 


J  Etft  (6  WQ.t*s)  ,t)ds 


Owing  to  the  uniform  Lipschitz  condition,  we  can  show  that 


p-limjf  (•)  “  f (8 (w_ ,*).') ]  ■  0, 


p-lim  (A  *  °f  0  ( • )  -  0/9t+A)f  (§(•),•)!  -  0, 


by  which  Theorem  1  guarantees  convergence  of  finite-dimensional 
distributions.  This,  together  with  the  tightness,  guarantees 
the  weak  convergence.  Q.E.D. 


* 
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5.  Adaptive  Antenna  Arrays  [13),  [14],  [15) 

For  another  illustration  of  the  general  idea,  we  consider  a 
standard  problem  in  adaptive  antenna  arrays.  Let  the  array  have 
r  elements  and  input  vector  zt(-)  •  {  z  j  C  • )  »  j*l, • • • ,r),  where 
z^(-)  -  s  ^  { * )  ♦  n|j  ( • )  and  s^(t)  -  A  cos  (w0t-nt  ^  ) ,  where  the  U^) 
are  assumed  known  (known  signal  transmission  direction) .  The 
input  to  the  jth  (each  j)  antenna  is  split  into  two  parts,  one 
part  passing  through  an  ideal  */2  phase  lag  device  whose  output 
we  denote  by  z^C)  -  n^(-)  ♦  s^-).  The  2r  outputs  are  weighted 
and  added  to  yield  the  "array  output"  x'  (•). 

Define  Z  -  ,  z^,...,z^)  =  (z  ,z  ),  denote 

the  respective  weights  by  W  »  (w.,...,wr,  w^,...,wr>  * 

(w,w)  and  set  S  ■  (s^,...,sr,  ■  (s,i). 

The  object  is  to  adaptively  adjust  W  in  order  to  adaptively 
maximize  the  signal-to-noise  power  ratio  in  the  output 
X  ■  w*z  ♦  w’z'  ■  W '  Z  .  Again,  for  convenience,  suppose  that 
the  noise  takes  the  form  n'  (t)  -  y(t/c2)/c  =  y'  (t)/t.  Let  MQ 
and  M*  denote  the  covariance  matrices  of  the  vectors  (y(0),y(0)) 
and  (n’  (0),  hc(0)),  resp.  Then  Mc  •  MQ/c2.  Assume  that  >  0 
(in  the  sense  of  positive  definite  matrices).  Then  the  optimum 
weight  vector  equals  WQ  «  kfi^Sg,  where  k  ^  0  is  any  constant 
and  SQ  -  (cosij,  j^r,  sin*^,  Define  Y1  (•)  -  (y ’  (•),yt(’))» 

Y  ( •  )  -  (y  ( * )  ,y  ( '  )  )  and  m'  (•)  -  Z€  ( • )  (Zf  ( • ) )  • .  Then  ( • )  = 

EMg(-)  -  Mq/c 2  ♦  (s  (• ) , s  (• ) )  (s  (• ) ,s  (• ) ) ' .  The  scheme  of  Fig. 

3  is  a  standard  method  [14],  [15]  of  adaptively  approximating 
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the  optimum  W Q.  The  describing  ordinary  differential  equation 
is  (g,  are  positive  constants,  t  is  a  time  constant)  (5.1), 
where  W(*)  should,  we  hope,  converge  to  something  "close  to"  WQ . 

(5.1)  iWl  -  -w'  -  gZlXL  ♦  gQS0 

-  -IgZL  (Zl  )  •♦I)W’  ♦  gQS0 

A  standard  method  of  treating  (5.1)  (see,  e.g.  (141  -  or 
papers  in  (  IS))  involves  first  dropping  the  signal  component 
of  1  ,  and  then  arguing  as  follows:  since  M*  (•)  is  wide-band 
and  W  (•)  is  much  smoother  than  m’  (•),  the  two  are  essentially 
independent,  so  assume  this,  take  expectations  in  (5.1)  and 
replace  (5.1)  by  the  resulting  equation  (5.2),  which  ought  to 
be  (approximately)  an  equation  for  the  mean  value  V?1  ( • )  of 
Wc  ( • )  . 

(5.2)  tW*  -  -IgMSlJW*  ♦  gQS0. 

The  asymptotic  solution  to  (5.2)  is  W '  -  gQ IgM  ♦I  1  which  is 

close  to  the  optimal  value  if  g  is  large. 

From  a  mathematical  point  of  view,  there  are  some  difficul¬ 
ties  with  this  line  of  reasoning  -  even  allowing  for  the 
usually  justifiable  neglect  of  the  s(*)  terms  in  (5.1).  As  the 
BW  of  n  (•)  increases,  thereby  "justifying"  the  "almost  indepen¬ 
dence"  assertion,  the  covariance  M*  must  also  increase  (see 


Section  2),  so  it’s  not  immediately  clear  what  one  can  say  about 


the  expectation  of  the  product  of  M*  (t)  and  W  (t) .  To  see  the 
problem  more  clearly,  consider  the  scalar  case  tWfc  - 
- (g (n£ (s) ) 2  +  l) WL*g0SQ  (where  we  set  s(*)  -  0),  solve  it  and  take 
expectations  to  get 

I  t 

*  » 

I  (5.3)  EWC  (t)  -  E  (exp  -  (g  (nL  (s) )  %l)ds)Wc  (0) 

0 

t  t 

♦  ds  E  exp  -  | [g (nc (u) ) 2+lJdu  90S0' 

0  s 

which  can  differ  considerably  from  the  solution  to  (5.2)  for 
snail  e. 

We  now  set  the  problem  up  in  a  way  that  admits  an  asymptotic 
analysis  (as  e  -*  0).  Clearly  g  must  be  inversely  proportional 
to  e  for  otherwise  M1  -*  ~  as  t  -•  0.  Suppose  that  an  automatic 
gain  control  mechanism  of  some  sort  is  available  and  that  we 
obtain  an  estimate  of  the  power  in  M  (•),  which  is  proportional 
to  1/e2.  Thus,  let  g  *  t2K  for  some  K  >  0.  Write  6Mg ( • )  ■ 

Mg  ( •  )  -  EMg { • ) ,  and  rewrite  (5.1)  in  the  form 

(5.4)  tW€  -  -{e2K*gE  +  c2 K&Mg  ♦  IJW1  ♦  gQS0 


-  - ( KM-  ♦  c2KSS’  ♦  CK (Yf S’+SYC * )  ♦  K (YCYC ’ -MQ) +1 )W 
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Define  6MQ ( • )  -  Y  ( • ) Y '  ( • ) -MQ  and  6M^(t)  -  &M0(t/c2).  Then 
it  is  clear  that  (Y { • ) , 6MQ < • ) )  plays  the  role  of  the  noise 
y(*)  of  Theorems  2  and  3.  Theorem  3  (extended  to  (2.2))  is  applic¬ 
able  here,  and  its  conditions  are  better  than  those  of  Theorem  1 
for  this  case.  If  y(*)  satisfies  the  (reasonable)  conditions  of 
Theorem  3,  then  {W'M*)}  is  tight  and  converges  weakly  to  the 
solution  of  (5.5)  as  e  *  0.  Here  the  limiting  diffusion  is 
degenerate  because  there  are  no  1/c  terms  in  (5.4): 

(5.5)  TW  -  -  ( KMQ+ 1 ) W  g0SQ. 

This  type  of  argument,  with  the  appropriate  scaling  of  g,  justi¬ 
fies  the  end  result  of  the  traditional  treatment,  namely  going 
from  (5.1)  to  (5.2).  Note  that,  owing  to  the  degeneracy  (the 
limit  is  an  ODE) ,  the  f^  component  of  fL  in  the  proofs  of 
Theorems  2  or  3  is  not  needed,  and  owing  to  this  (A3')  can  be 
dropped  from  Theorem  3.  We  note  in  passing  that  the  scaling 
g  -  (K/average  power)  is  often  used  in  practice  due  to  "dynamic 
range”  considerations.  So  our  scaling  conforms  with  practice  - 
even  if  this  particular  practice  is  not  traditionally  used  in 
the  development  of  (5.5),  it  is  actually  required  for  its  jus¬ 
tification  . 

First-order  noise  effects.  The  system  (5.4)  can  readily 
be  centered  and  scaled  in  order  to  get  the  first-order  noise 
effect.  Define  U*  (•)  ■  (W*  (•)-W(-))/e»  A  comparison  of  the 


following  development  with  that,  say,  in  [14]  reveals  some  of 
the  mathematical  shortcomings  of  the  usual,  more  heuristic 
approach .  Then 

(5.6)  be  -  -K[M0+1]U‘  -K(Yl  S'+SY1  ' )Wl -  KtSS’W 

-  K(6Mq/c)W  -  K6MqUC ,  UL(0)  -  0. 

Theorems  2  or  3  can  be  applied  and,  again,  (Y ( • ) , 6MQ { • ) )  plays 
the  role  of  y(<)  in  those  theorems.  If  (Y ( • ) , 6Mg ( • ) )  satisfies 
the  conditions  on  the  y ( * )  of  those  theorems,  then  { U '  (•))  is 
tight  and  converges  weakly  to  a  process  U ( - )  with  the  Ito  equa¬ 
tion  representation 

(5.7)  TdU  -  -K[H0-*-I)Udt  ♦  K  dB, 

where  B(*)  is  a  non-standard  Wiener  process  whose  covariance  can 
be  obtained  from  the  (a^)  in  the  operator  A  in  (2.4)  in  the 
following  way. 

The  operator  (A+3/dt)  is  given  by 
(A+a/3t)f(U,t)  -  ft(u,t)  ♦  r(U,t)  l-K(M0+I)U]  ♦  (5.8),  where 

(5.8)  is  the  integral  term  in  (2.4)  (note  W ( - ) 


is  not  random) 
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(5.8)  K‘ 


* 

EW* 


(t) 6Mq (0) f uu (Uf  t) 6Mq (s) W (t ) ds 


■  K  trace  f  (U 
uu 


w 

,t)  |  E6MQ(0)W(t)W'(t)  6Mq  ( s  )  ds 


(If  the  (a^tt)}  in  (5.8)  is  not  symmetric,  then  symmetrize  it 
so  that  a^  ■  a^.)  The  "infinitesimal"  covariance  of 
B (t+dt) -B (t)  is  (a^jttJJdt.  Then,  to  first-order  terms  and  with 
wide-band  input  noises,  W(t)  -  W(t)  ♦  cU(t).  Note  that  the  limit 
equation  (5.7)  does  not  have  a  "correction"  term  since  the  1/e 
term  in  (5.6)  does  not  involve  Uc .  The  lack  of  a  "correction" 
term  is  not  a  priori  obvious,  however. 


Convergence  on  [0,“).  Normally,  the  part  of  U  (•)  that  is 
of  most  interest  is  the  "tail".  We  would  like  to  know,  for  example, 
that  the  distributions  of  Uc(t),  t  >  T,  are  close  to  those  of 
the  stationary  solution  to  (5.7)  for  small  e  and  large  enough 
T.  Weak  convergence  does  not  quite  give  this  type  of  result. 
However,  in  this  case,  a  useful  result  is  not  hard  to  get.  We 
only  state  it  -  the  details  of  proof  of  a  similar  case  are  in 

(  9  ). 

If  y(*)  is  a  bounded  process,  then  it  can  be  shown  that 


(5.  9) 


sup  E | U  ( t ) | 2  is  bounded 

t>0,  c  small 


(5.10) 


[UL (T+ • )  ,  T>0,  t  small)  is  tight  in  D2r[0,®) 

and  Ue(T*-)  tends  weakly  to  the  stationary  solution  to  (5.7),  as 
e  -►  0  and  T  -*  ■*>  in  any  way  at  all. 

A  proof  is  in  l  9  1.  (5.8)  is  the  key,  and  is  obtained 

via  a  Liapunov  function  stability  analysis  of  (5.6). 

The  Liapunov  function  V*  ('),  constructed  according  to  a 
method  in  (  9  ]  ,  has  the  form  Vc  (• )  «  V  ( •  )  ♦  ( • )  ♦  ( • )  ♦ 

( '  ) *  where  V ( * )  is  a  Liapunov  function  for  the  deterministic 
system  (5.5)  and  V1  is  a  perturbation  calculated  from  V ( • >  more 
or  less  the  way  fl  was  calculated  from  f ( * )  in  Theorem  2.  The 
ability  to  obtain  results  of  the  type  stated  in  (5.8)  and  below 
it  is  a  very  useful  byproduct  of  the  method  discussed  in  this 
paper.  In  fact,  the  traditional  method  of  analysis  of  this  prob¬ 
lem  assumes  some  sort  of  asymptotic  stationarity  and  stability  [14]. 


6.  Filtered  Hard  Limited  Signal  Plus  Noise 

For  the  final  example,  we  consider  the  case  of  a  continuous 
signal  plus  noise  s(t)  +  n*  (t)  passing  through  a  hard  limiter 
(level  L)  followed  by  a  band  pass  filter.  In  a  classical  paper, 
Davenport  (16)  treated  a  form  of  this  problem  where  s(t)  ■ 

A  cos  pt  and  n*  (•)  had  total  power  N  and  a  spectrum  in  a  fixed 
band  centered  around  p.  He  obtained  specific  values  for  the 


ratio 
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(6.1)  l (signal  power)  /Noise  power  out  -  /[(signal  power).  /Nj 

in  a  band  around  p  in 

in  the  two  limiting  cases  of  N  ■*  0  and  N  ♦  <*>. 

Our  assumptions  are  a  little  different.  Here,  s(*)  is  an 
arbitrary  continuous  function.  Again  we  set  nf  (t)  -  y(t/e2)/e 
and  pass  s(t)  +  n'  (t)  through  a  hard  limiter  and  then  through 
any  filter  which  has  a  linear  differential  (or  even  nonlinear, 
if  we  wish)  equation  representation.  The  stochastic  differen¬ 
tial  equation  which  represents  the  output  is  derived,  and  from 
it  we  can  readily  obtain  a  limit  value  for  an  input-output  ratio 
similar  to  (6.1).  In  order  to  keep  the  notation  simple,  we 
first  suppose  that  the  limiter  is  followed  only  by  an  integrator. 
As  in  Section  4,  the  general  case  is  handled  in  exactly  the 
same  way;  the  form  of  the  filter  does  not  affect  the  method. 

The  output  xl (•)  is  given  by 

-  K  sign (s  It)  +  nc(t)), 

where  K  is  a  scale  factor  whose  value  will  not  affect  the  power 
ratios.  In  fact,  it  is  convenient  to  use  *  L/e,  which  we  will 
do.  Then 

*c  *  ^  sign  [s  (t)  +  y(t/c2)/e). 


(6.2) 
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Although  (6.2)  differs  from  the  forms  (2.1)  used  in  Theorems  2 
and  3,  particularly  because  of  the  1/t  factor  appearing  both 
inside  and  outside  the  sign  function,  Theorem  1  can  still  be 
used  and  a  proof  similar  to  that  of  Theorem  2  gets  the  correct 
limit  and  the  construction  of  the  f'  (•).  We  go  through  some 
of  the  details  below,  in  order  to  illustrate  the  versatility 
and  robustness  of  the  technique.  Since  x  (•)  is  not  involved 
as  either  an  argument  or  coefficient  of  the  sign  function,  the 
scheme  is  not  hard  to  use. 

To  facilitate  computation,  we  let  y(-)  be  Gaussian  with 
correlation  function  o  exp  -ajt|  (a  >  0).  It  will  be  shown 
that  as  c  -*  0,  x'  (•)  converges  weakly  to  a  process  x(*)  which 
has  the  ltd  representation 

(6.3)  dx  -  UTfr  (5-i-ti.) dt  ♦  iVi  in  2/a  dB. 

o 

If  a  filter  of  the  form  used  in  Fig.  2  follows  the  limiter  (where 
we  set  Q  *  0  to  avoid  white  noise  in  the  output) ,  then  the  limit 
equation  is 

(6.4)  dv  *  Dvdt  ♦  Edx 


z(t)  *  output-  Cv(t). 
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Input-output  signal- to-noise  ratios.  The  integrated  input 

noise  /  n'(s)ds  converges  weakly  (as  in  Section  4)  to  a  Wiener 

0  2 
process  whose  covariance  at  time  t  is  2o  t/a  *  2Rt  « 

CD 

2t  /  Ey(0)y(s)ds.  Thus,  as  c  -*  0,  the  input  power  per  unit  BW 

0  2 
(in  any  finite  frequency  range)  converges  to  2o  /a.  In  order  to 

get  a  concrete  power  ratio  comparison  with  Davenport's  result, 
set  s(t)  •  A  cos  pt  here  only.  Consider  the  form  (6.4)  where  the 
system  dimension  and  D,  E,  C  are  chosen  to  get  a  good  approxima¬ 
tion  to  a  zonal  filter  whose  pass  band  includes  p.  The  filter 
gain  is  unimportant  since  it  does  not  affect  the  ratio,  so  we 

assume  that  it  is  unity  in  the  pass  band.  The  noise  power  per 

2 

unit  BW  in  dx/dt  is  L  2  In  2/a,  the  limit  output  noise  power 

2  2  2 

per  unit  BW  as  c  -*  0 .  The  signal  power  in  dx/dt  is  L  A  /no  , 
the  limit  output  signal  power  as  e  •*  0.  Thus 


(Signal/Noise  power  per  unit  BW)  ./(Signal/Noise  power  per 

unit  BW) . 

in 

(6.5)  -  (iV,/(iLLia -^/(jl/^  .  , 

T JO 

which  is  slightly  greater  than  Davenport's  (1}  limit  ratio  (as 
his  S  *  •)  of  w/4. 

This  closeness  of  the  two  results  is  very  pleasing.  Since 
our  assumptions  are  different,  it  suggests  that  our  scheme  might 
yield  results  that  are  meaningful  under  other  circumstances 
where  similar  averaging  phenomena  occur.  In  our  case,  the 
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input  noise  energy  per  unit  BW  is  held  constant  and  the  BW 
increased.  In  116),  the  BW  is  held  fixed  and  the  power  per  unit 
BW  -*  <*>  (to  get  the  n/4  limit  ratio).  In  Il6 )  »  the  ratio  seems 
to  decrease  as  the  input  noise  power  increases,  which  is  consis¬ 
tent  with  our  result  2/n  In  2  >  n/4,  since  our  power/unit  BW  is 
held  fixed.  The  "averaging"  phenomena  in  both  cases  are 
similar  -  in  that  the  existence  of  the  limit  makes  implicit  use 
of  the  "wild"  fluctuations  and  "large"  magnitude  of  the  noise. 

In  a  promising  study  currently  under  way,  a  phase-locked 
loop  with  a  saturator  like  non-linearity  is  being  studied  and 
compared  (favorably)  to  the  more  standard  systems.  Asymptotic 
methods  such  as  described  here  are  used.  They  seem  to  be  the 
only  available  tool. 

Now  an  outline  of  the  proof  that  the  xc(*)  of  (6.2) 
converges  weakly  to  the  solution  of  (6.3)  will  be  given.  The 
proof  for  the  general  filter  case  with  limit  (6.4)  is  about  the 

same . 


Theorem  4 .  Let  s ( • )  be  continuous ,  y ( • )  Gaussian  with  covariance 
a2  exp  -  a | t |  and  mean  zero.  Then  (x"(*))  is  tight  and  as  c  *  0, 
converges  weakly  to  the  process  x ( - )  given  by  (6.3)  (integrator 
only  used)  or  (6.4)  (general  filter  following  the  limiter  used). 

Proof .  We  stick  to  the  integrator  case.  The  general  case 
requires  only  carrying  an  extra  "drift"  term,  and  is  done  in 
exactly  the  same  way.  denotes  conditioning  on  y(u/e^),  u  <  t. 
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Part  1.  Set  y(*)  -  oza(*),  where  za  ( • )  has  correlation 
exp  -a | t J  and  let  z  denote  a  random  variable  with  the  normal 
N(0,1)  distribution.  The  factor  L  is  unimportant,  so  set  L  «  1 
here.  Wq  evaluate  G^Cs)  ■  E  sign (s+oz/e ) .  Then 

(6.6)  G(  (s)  -  [erf(£-)  -  erf(^)]  -  |  c  ♦  o(e) 

where  o(c)/c  -  0  as  c  -  0,  uniformly  in  s  in  any  bounded  set. 
Define  G(s)  3  /27*  ls/o)  and 

Fl  (s.  <jza(t)/t)  -  [  sign  ( s+oza  (t)  / 1)  -  e  sign  (s+oza  ( t ) A)  ] 

-  (sign (s+oza(t)/t) ]  -  Gfc(s) . 


Then 


(6.7) 


c  -  /2 7=.  -  *  0(c)  ♦  Fk  (s , 0za (t A  ) A) A< 


Some  details  will  be  omitted.  We  note  that  F^la,t)  is  p-right 
continuous  since  y(')  has  a  continuous  density  *  similarly  with 
other  functions  below  for  which  this  property  is  needed. 

The  aim  now  is  to  apply  Theorem  1.  Given  f  €  • 

(f*  }  must  be  found  such  that  the  "p-lim"  requirements  of 
Theorem  1  hold.  The  other  conditions  of  Theorem  1  are  satis¬ 
fied,  where  A  is  the  operator  of  the  process  (6.3).  The  method 
of  proof  of  Theorem  2  will  be  used  to  get  both  {fc}  and  A. 
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S’milarly  to  the  situation  in  Theorem  2,  f‘  will  have  the 
form  (no  f q  is  needed,  since  G  is  not  random)  fl (t)  ■ 
f(xc(t))  ♦  f i  ( x 1  (t),t)  ♦  f^tx'  (t),t).  Setting  x  -  x  (t) , 
s  *  s(t)  for  notcticnal  simplicity,  we  have 

(6.8)  ALf(x,t)  -  ffc(x,t)  ♦  fx(x,t)  IG(  (s)  i  Fc  (s,  01*1  t/t2)/t)/cj  . 
Define 


fj(x.t)  -  |fx  (x,t  +  u)E^F  (• ( t*u ) , cza  (^— )/t  )du 

0  C 

OD 

•  c  f  (x ,  t*t  2u)  E^F  (s  (t+c^u)  ,  oza  (^y*-u)/t  )du. 


Owing  to  the  fact  that  F  is  "centered"  about  its  expectation 
and  to  the  exponential  correlation  of  y(*),  the  integral  exists 
and  p-lim  fjL  -  0.  Also,  fj  G  fj*(Ac)  and  (use  x  -  xe(t)> 


(6.9)  Acf[(t)  -  -fx(x,t)Fc  (,(t) ,  cza(t/c2)/t)/t 


4  -  [  fvv(X,t  +  U)E.CF  (s(t  +  u),02a(^4^)/c)du 
c  j  x  x  v  c  _  • 

0 

F  (a  (t) , oz* (t/e2)/e ) 

*  t - - -  ♦  Ge(s(t))). 


1 
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Calculations  such  as  showing  fj^  €  I/(AL)  are  not  hard  here, 

since  F  does  not  depend  on  the  state  x.  Similarly  for  below. 

The  first  term  of  (6.9)  cancels  the  last  term  of  (6.8) 

(which,  of  course,  is  the  reason  for  introducing  f^(*)).  The 

2 

integral  of  (6.9)  exists  and  equals  (change  variables  u/c  -  u) 


'6.1°)  fx„  (x,t  +  .  2u)Ej.F  |s(tn2u),oza(Vu)A)du 


,’F  (s(t),oza  (t/t2 )  /c )  ♦tG(.  (s  (t ) )  ) 


The  tG(  term  goes  to  zero  in  the  p-liir.  sense  as  c  -*  0,  and  it 


is  ignored  henceforth.  Still  following  the  method  of  Theorem 
2,  let  Ag f (x , t )  denote  the  expectation  of  (6.10)  (minus  tne 
tG  term).  Then  (the  integral  exists,  again  by  the  centering 
of  F  am 
of  y ( ■ ) ) 


of  F  and  the  exponential  decrease  in  the  correlation  function 


Aff(x,t)  -  If  v (x, t  +  e^uJEF  (c (t i c2u) ,oza (u)/c)Fc (s  (t) ,oz“  (0)/c)du 
0  ^  xx  * 


Next,  define  f^(x,t),  the  "centered  and  averaged"  last  term  of 


(6.9)  (minus  the  Gf  term),  by 


f  2  ( x ,  t )  - 


d  v{  j  dv  (fxx  (x,  t+u+v)  ^-jE(tFL  (s  (t+u+v)  ,°za  (**jp*y)  ) 


0  0 


F(  ( S  { t  ♦  V  )  ,fza(iy^))l-  Agf  (X,t+V)  ). 


It  can  be  shown  that  the  integral  exists, 
the  centering  about  the  mean  Agf(x,t«-v)  of  the 

and  the  exponential  correlation  function.  Via 

2  2 

change  cf  variables  u/c  -  u#  v/»  >  v,  we  get 

E }  f  2  c  X  #  t )  I  -  OU2).  Also  fi;  €5>(A'  )  and  (use 


again  by  use  of 
inner  integral , 
the  usual 
that 

x  -  xc(t)) 


A' f ^  (x, t )  ■  minus  first  term  on  r.h.s.  of  (6.10)  plus 
terms  whose  (absolute)  expectation  is  0(- ) 
♦  Apf (x, t ) . 


Then,  concluding, 

(6.11)  p-limlf(-)  -  flxc(-),*)J  -  0, 

p-lim[Acfe (’)-ft(xe (•) ,*)-Ajf  (xc(-),*)-Gt  (s)fx  (xL(-).*)J  -  0. 

In  Parts  2  and  3  below,  it  is  shown  that  ((6.12)  defines  AQ ) 
f  (x,t)ln  2 

(6.12)  A^f(x,t)  ■*  - -  5  A^f(x,t)  uniformly  in  x  for  each  t. 


This,  together  with  G(  (s)  -*  G(s)  uniformly  on  bounded  s-seta 
(hence  fxG^  -  fxG  uniformly  in  x  for  each  t)  yields  the  theorem,  since 
the  process  of  (6.3)  is  the  unique  process  corresponding  to  the 
operator 


U/3t  ♦  A)  -  (^  ♦  G(s(t))|jj  ♦  y(- 


ln 


dx 


a 
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Part  2.  Evaluation  of  AQf.  Let  i{‘)  denote  a  Gaussian 
process  with  correlation  function  exp  -|t|.  Changing  variables 
u(old)  ■*  u(new)/a  yields 

A^ffx.t)  -  Jljtx.t) 

where 

QD 

(6.13)  lj(x,t)  -  |  f xx (x,  t+e2u/a ) [Qc  (t ,  u)  -  Rc(t,u)]du, 

0 

where 


0  (t,u)  »  E  sign (s (t+uc2/a)  ♦  ^  z  ( u )  )  sign  (s  (t)  -*^z  (0)  J  , 
R*"  (t,u)  -  E  sign  Is  (t+^— )  •♦-— z  (u)  ]  E  sign  Is  (t ) -*-7-2  (0)  1  . 

a  t  t 


Owing  to  the  properties  of  the  joint  distribution  of 
T 

(z  (0 ) i z  (u) } 1  ^  j  QL ( t ,u) -Rl (t,u) |du  is  bounded  uniformly  in  e  and 
T  and  converges  uniformly  in  e  is  T  ♦  (In  fact,  the  integrand 

goes  to  zero  at  an  exponential  rate  as  u  *•  “.)  Using  this  and 
the  smoothness  and  compact  support  of  f xx ( • , • ) ,  we  can  replace 
f  (x,t+e2u/a)  by  f  (x,t)  in  l!:(x,t)  without  altering  the  limit 

XX  XX  » 

as  c  *  0. 

By  the  above  arguments*  if  QL (t ,u) -R  (t , u)  has  a  limit  for 
each  t  as  t  *  0 ,  then 


(6.14)  lim  Aq  f  fx,t) 
e-0 


1 

a 


0 


lim  [Qc (t , u) -Rc ( t « u ) ] du 
e -0 


and  also  that  in  order  to  show  (6.12)  it  is  enough  to  show  that 
the  integral  on  the  right  equals  In  2.  First  the  existence  of 
the  limit  will  be  shown.  Let  s*  -  s(t+c^u/a)  and  s  -  s(t). 

Then 

Ql ( t , u )  ■  P  {* (u) >-s*c/o , Z (0) >-f c/o }+P {* (u)  <-s*c/c, z (0) <-sc/o ) 
-P (2  (u! >-s* t/c,  z (0)  <-s. /o  )-P  (z  (u)  t/o , Z  (0)  '-Sl/o  } 

Rt(t,u)  •  [P{  z  (u)  >-s\/o  }-P{  z  (u)  <-s*e/o }  1 

•  (P{z (0) >-ec/o)-P( Z (0) <-8t/o) ) . 

Obviously  as  c  ■*  0,  R*  (t,u)  -*  0  (even  uniformly  on  bounded 
s,  s * ,  t,  u  sets,  although  we  don't  need  this).  Also  (even 
uniformly  as  above) 

QC  (t,u)  -»  P(  z  (u)  >0  ,  z  (0)  >0  )  ♦  P(  z  (u)  <0  ,  z  (0 )  <0  ) 

-  P(z (u) <0,z (0) >0  )  -  P{ z (u) >0 , z (0) <0  ) 

-  2  I P { z (u) >0,z (0) >0  )  -  P(z (u) <0,z  (0) >0)) 

5  2J(u)  . 


Define 
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J  (u) du. 


Then  we  have  proved  that 


Agf  (Xf  t) 


2f  (x,t) 


xx 


J  (u)  du  - 


Agf ( X , t ) 


uniformly  in  x  for  each  t.  We  need  only  evaluate 


Part  3.  Proof  that  JQ  -  (In  2)/2.  Use  polar  coordinates 
ir.d  write  p  -  e~u.  Then  the  joint  density  of  (z(0),z(u))  is 


- l-  ^  y/2  («*P  -r2g(8)/2)r, 

2-  (l-o  )  ' 


g(‘>)  *  5lcos20-2csin0cose  +  sin2ei  -  j( l-psin2® ]  . 
1-p2  l-o 


Also 


n/2 


J  (u)  ^  P  (z  (0)  >0,  z  (u)  >0} 


(u)  :  P  (z  (0!  '0,  z  (u)  <0  ) 


1  I 


- r .  .  jk  (exp  g  (0)  )drd9, 

o  o2-*1-0* 

0  2 
■  [  f  - r  k  r-ry  (exp-£— g  (9)  )drd6. 

J/2  i  2MWV71  ^ 


Integrating  with  respect  to  r  yields 


'/2 


J  (u)  ■ 


dOJl-P2) 


- 3  T?T 

2f  (1-0  )  '  (1-Psin29) 


ihVi! 


i  f»  a,.  _ _ 

J  J  l-Psin'1'  5" 

0 


By  (16,  eqn.  298  1  , 


I 
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